Skip to content

Accept image pastes from the clipboard for multi-modal LLMs to consume #19517

@TheJoeFin

Description

@TheJoeFin

Description of the new feature

When using CLI based multi-modal LLMs for coding, it is common to paste a screenshot or image to the LLM for reference. In Windows I have to take a screenshot then drag and drop the file into the terminal chat which works, but is awkward and requires context switching.

Proposed technical implementation details

I'm not sure how nix based shells handle this, but it is very cool and handy. You can see in the original Claude Code demo they paste the image directly into the shell.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Issue-FeatureComplex enough to require an in depth planning process and actual budgeted, scheduled work.Needs-Tag-FixDoesn't match tag requirementsNeeds-TriageIt's a new issue that the core contributor team needs to triage at the next triage meeting

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions