Give Claude Code eyes and ears - perception for real-time screen vision, voice, and audio understanding.
Explore the docs ->
Report Issues
Claude.Pair.Programmer.mp4
# Add marketplace (one-time)
/plugin marketplace add video-db/claude-code
# Install Pair Programmer
/plugin install pair-programmer@claude-code
# Configure with your VideoDB API key
/pair-programmer:setupImportant: After installation and config setup, exit your current Claude Code session and start a new one. The plugin registers MCP servers and skills that only load on session startup.
# Start recording
/pair-programmer:recordYou can also start recording from the MacOS Tray Menu (System Tray Icon).
The Problem: Traditional AI coding assistants are context-blind. When you code with Claude Code, you're constantly copy-pasting screenshots, describing what's on screen, and repeating yourself. Context switching breaks your flow.
The Solution: Pair Programmer gives Claude Code perception - eyes to see your screen, ears to hear your voice and system audio, and memory to track context automatically.
Like a programmer sitting next to you who sees your terminal errors, hears your questions, and remembers recent context without you repeating anything.
A perception layer for Claude Code that streams your screen, microphone, and system audio to VideoDB's capture SDK, runs real-time AI indexing, and feeds context directly into Claude Code.
- Screen Vision: Real-time visual indexing with AI-generated scene descriptions. Semantic search across screen history.
- Voice Hearing: Live microphone transcription with intent classification (question, command, thinking aloud).
- Audio Awareness: System audio capture (meetings, tutorials) with source classification and summarization.
- Context Continuity: FIFO buffers keep last N items in memory. Claude remembers recent context automatically.
- Start recording via
/pair-programmer:record- continuously captures screen, mic, and system audio in the background - Context buffers fill up automatically as you work (last 50 items by default)
- Trigger AI analysis anytime via keyboard shortcut (
Cmd+Shift+Aby default) or/pair-programmer:cortex - Multi-agent system analyzes buffered context:
- cortex - Orchestrator that correlates reports and synthesizes answers
- code-eye - Reads visual screen context (files, errors, activities)
- voice - Classifies speech intent and extracts keywords
- hearing - Classifies system audio source
- narrator - Shows status and responses in overlay
- Answer appears in always-on-top overlay window with analysis and suggestions
No copy-pasting. No context switching. Just keep working and ask when you need help.
Recording & Streaming
- Real-time screen, mic, and system audio capture via VideoDB SDK
- Multi-channel streaming with AI visual indexing, transcription, and audio summarization
- FIFO context buffers (default 50 items each)
Multi-Agent System
- Parallel sense agents report independently to orchestrator
- Adaptive reading strategies based on context size
- Semantic search across rtstream history
Control Interfaces
- MacOS Tray Menu: Start/stop recording, show/hide overlay
- CLI Commands: Slash commands for config, status, search
- Keyboard shortcut: Configurable trigger for AI analysis (default
Cmd+Shift+A) - Overlay Window: Always-on-top responses and loading states
Session Features
- Cloudflare tunnel for webhook delivery
- Exported video saved to your VideoDB account
| Command | Description |
|---|---|
/pair-programmer:record |
Start or stop recording (opens source picker on first start) |
/pair-programmer:setup |
Initial setup: API key, install dependencies |
/pair-programmer:config |
Configure API key, buffer sizes, prompts, keyboard shortcut |
/pair-programmer:record-status |
Check recording state, rtstream IDs, buffer sizes |
/pair-programmer:refresh-context |
Fetch current screen/mic/system_audio context |
/pair-programmer:what-happened |
Summarize recent activity with timeline |
/pair-programmer:cortex |
Trigger AI analysis of buffered context |
Access via /pair-programmer:config. Key settings:
| Setting | Default | Description |
|---|---|---|
videodb_api_key |
- | Get your API key |
context_buffer_size_* |
50 |
Max entries per buffer (screen/mic/audio) |
assistant_shortcut |
CommandOrControl+Shift+A |
Global keyboard shortcut |
visual_index.enabled |
true |
Enable screen indexing |
mic_index.enabled |
true |
Enable microphone transcription |
system_audio_index.enabled |
true |
Enable system audio indexing |
- macOS 12+ (Monterey or later)
- Node.js 18+
- Claude Code CLI - Install guide
- VideoDB API Key - Sign up
macOS Permissions (System Settings > Privacy & Security):
- Microphone Access
- Screen Recording
- System Audio Recording
- Accessibility (optional, for overlay always-on-top)
- Plugin Issues: GitHub Issues
- VideoDB SDK Issues: VideoDB Capture Quickstart
- Documentation: docs.videodb.io
- Discord: Join community
Made with ❤️ by the VideoDB team