Add image and audio to prompt/exec#3
Merged
vaibhavpandeyvpz merged 1 commit intoone710:mainfrom Mar 16, 2026
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Adds support for multiple prompt content types in
promptandexec: text (existing), image, and audio. Users can pass files with--imageand--audio(repeatable). Content is sent as ACP content blocks (text, image, audio) insession/promptrequests.Usage
--image <path>,--audio <path>(global; repeatable). Must appear before the command.codeye --image screenshot.png exec "what is shown here?"codeye --image a.png --image b.png cursor prompt <session-id> "compare these"codeye --audio meeting.wav exec "transcribe and list action items"Implementation
ACP
PromptTextPartreplaced byPromptPartwithType,Text,MimeType,Data(base64).TextPrompt(text) []PromptParthelper for text-only prompts.SessionPromptRequest.Promptis now[]PromptPart.Client & runtime
Client.Prompt(ctx, sessionID, parts []acp.PromptPart); all session runtime prompt/exec entrypoints take[]acp.PromptPartinstead of a single string.acp.TextPrompt("...")for backward compatibility.CLI
internal/cli/prompt_parts.go:BuildPromptParts(text, imagePaths, audioPaths)builds the parts list, reads files, infers MIME type, and base64-encodes..png,.jpg,.jpeg,.gif,.webp; audio.wav,.mp3,.mpeg,.ogg,.flac,.m4a.dispatchforpromptandexecbuilds parts from positional text plusflags.AudioPathsandflags.ImagePaths.Queue
Request.PromptParts []acp.PromptPart(optional). When present, server uses it; otherwise usesPromptas a single text part (backward compatible).Handler.Prompt(ctx, sessionID, parts []acp.PromptPart)updated; queue tests adjusted.Docs
--audio/--imagein global options and a short usage note.Notes