Codebuff

Codebuff is a tool for editing codebases via natural language instruction to Buffy, an expert AI programming assistant.

Project Goals

Developer Productivity: Reduce time and effort for common programming tasks
Learning and Adaptation: Develop a system that learns from user interactions
Focus on power users: Make expert software engineers move even faster

Key Technologies

TypeScript: Primary programming language
Bun: Package manager and runtime
WebSockets: Real-time communication between client and server
LLMs: Multiple providers (Anthropic, OpenAI, Gemini, etc.) for various coding tasks

Main Components

LLM Integration: Processes natural language instructions and generates code changes
WebSocket Server: Handles real-time communication between client and backend
File Management: Reads, parses, and modifies project files
Action Handling: Processes various client and server actions
Knowledge Management: Handles creation, updating, and organization of knowledge files
Terminal Command Execution: Allows running shell commands in user's terminal

WebSocket Communication Flow

Client connects to WebSocket server
Client sends user input and file context to server
Server processes input using LLMs
Server streams response chunks back to client
Client receives and displays response in real-time
Server sends file changes to client for application

Tool Handling System

Tools are defined in backend/src/tools/definitions/list.ts and implemented in npm-app/src/tool-handlers.ts
Available tools: read_files, write_file, str_replace, run_terminal_command, code_search, browser_logs, spawn_agents, web_search, read_docs, run_file_change_hooks, and others
Backend uses tool calls to request additional information or perform actions
Client-side handles tool calls and sends results back to server

Agent System

LLM-based Agents: Traditional agents defined in .agents/ subdirectories using prompts and LLM models
Programmatic Agents: Custom agents using JavaScript/TypeScript generator functions in .agents/
Dynamic Agent Templates: User-defined agents in TypeScript files with handleSteps generator functions
Agent templates define available tools, spawnable sub-agents, and execution behavior
Programmatic agents allow complex orchestration logic, conditional flows, and iterative refinement
Generator functions execute in secure QuickJS sandbox for safety
Both types integrate seamlessly through the same tool execution system

Shell Shims (Direct Commands)

Codebuff supports shell shims for direct command invocation without the codebuff prefix.

Cross-platform: Works on Windows (CMD/PowerShell), macOS, and Linux (bash/zsh/fish)
Store integration: Uses fully qualified agent IDs from the agent store
Easy management: Install, update, list, and uninstall shims via CLI commands### Quick Start (Recommended)

# One-step setup: install and add to PATH automatically
codebuff shims install codebuff/base-lite@1.0.0

# Use immediately in current session (follow the printed instruction)
eval "$(codebuff shims env)"

# Now use direct commands!
base-lite "fix this bug"             # Works right away!

Package Management

Use Bun for all package management operations
Run commands with bun instead of npm (e.g., bun install not npm install)
Use bun run for script execution

TypeScript Build State Management

Cleaning Build State

Use bun run clean-ts to remove all TypeScript build artifacts (.tsbuildinfo files and .next cache)
This resolves infinite loop issues in the typechecker caused by corrupted or stale build cache

Common Issues

Typechecker infinite loops are often caused by stale .tsbuildinfo files or circular project references
Always clean build state when encountering persistent type errors or infinite loops
The monorepo structure with project references can sometimes create dependency cycles

Error Handling Philosophy

Prefer ErrorOr<T> return types over throwing errors.

Return type ErrorOr<T> for operations that fail
Return success(value) or failure(error) from common/src/util/error.ts
- e.g. return failure(new Error('File not found'))
Allows callers to handle errors explicitly without try-catch
Makes error cases visible in function signatures

Error Handling and Debugging

Error messages are logged to console and debug log files
WebSocket errors are caught and logged in server and client code

Security Considerations

Project uses environment variables for sensitive information (API keys)
WebSocket connections should be secured in production (WSS)
User input is validated and sanitized before processing
File operations are restricted to project directory

API Endpoint Architecture

Dependency Injection Pattern

All API endpoints in web/src/app/api/v1/ follow a consistent dependency injection pattern for improved testability and maintainability.

Structure:

Implementation file (web/src/api/v1/<endpoint>.ts) - Contains business logic with injected dependencies
Route handler (web/src/app/api/v1/<endpoint>/route.ts) - Minimal wrapper that injects dependencies
Contract types (common/src/types/contracts/<domain>.ts) - Type definitions for injected functions
Unit tests (web/src/api/v1/__tests__/<endpoint>.test.ts) - Comprehensive tests with mocked dependencies

Example:

// Implementation file - Contains business logic
export async function myEndpoint(params: {
  req: NextRequest
  getDependency: GetDependencyFn
  logger: Logger
  anotherDep: AnotherDepFn
}) {
  // Business logic here
}

// Route handler - Minimal wrapper
export async function GET(req: NextRequest) {
  return myEndpointGet({ req, getDependency, logger, anotherDep })
}

// Contract type (in common/src/types/contracts/)
export type GetDependencyFn = (params: SomeParams) => Promise<SomeResult>

Benefits:

Easy to mock dependencies in unit tests
Type-safe function contracts shared across the codebase
Clear separation between routing and business logic
Consistent pattern across all endpoints

Contract Types Location: All contract types live in common/src/types/contracts/.

Contract Type Pattern: For generic function types, use separate Input/Output types:

// Define input type
export type MyFunctionInput<T> = {
  param1: string
  param2: T
}

// Define output type
export type MyFunctionOutput<T> = Promise<SomeResult<T>>

// Define function type using Input/Output
export type MyFunctionFn = <T>(
  params: MyFunctionInput<T>,
) => MyFunctionOutput<T>

Testing Guidelines

Dependency Injection (Primary Approach)

Prefer dependency injection over mocking. Design functions to accept dependencies as parameters with contract types defined in common/src/types/contracts/.

// ✅ Good: Dependency injection with contract types
import type { TrackEventFn } from '@codebuff/common/types/contracts/analytics'
import type { Logger } from '@codebuff/common/types/contracts/logger'

export async function myFunction(params: {
  trackEvent: TrackEventFn
  logger: Logger
  getData: GetDataFn
}) {
  const { trackEvent, logger, getData } = params
  // Use injected dependencies
}

// Test with simple mock implementations
const mockTrackEvent: TrackEventFn = mock(() => {})
const mockLogger: Logger = {
  error: mock(() => {}),
  // ... other methods
}

Benefits:

No need for spyOn() or mock.module()
Clear, type-safe dependencies
Easy to test with simple mock objects
Better code architecture and maintainability

When to Use spyOn (Secondary Approach)

Use spyOn() only when dependency injection is impractical:

Mocking global functions (Date.now, setTimeout)
Testing legacy code without DI
Overriding internal module behavior temporarily

// Use spyOn for globals
spyOn(Date, 'now').mockImplementation(() => 1234567890)

Avoid mock.module()

Never use mock.module() for functions. It pollutes global state and carries over between test files.

Only use for overriding module constants when absolutely necessary:

Use wrapper functions in @codebuff/common/testing/mock-modules.ts
Call clearMockedModules() in afterAll

Test Setup Patterns

Extract duplicative mock state to beforeEach:

describe('My Tests', () => {
  let mockLogger: Logger
  let mockTrackEvent: TrackEventFn

  beforeEach(() => {
    mockLogger = {
      error: mock(() => {}),
      warn: mock(() => {}),
      info: mock(() => {}),
      debug: mock(() => {}),
    }
    mockTrackEvent = mock(() => {})
  })

  afterEach(() => {
    mock.restore()
  })

  test('works with injected dependencies', async () => {
    await myFunction({ logger: mockLogger, trackEvent: mockTrackEvent })
    expect(mockTrackEvent).toHaveBeenCalled()
  })
})

Constants and Configuration

Important constants are centralized in common/src/constants.ts:

CREDITS_REFERRAL_BONUS: Credits awarded for successful referral
Credit limits for different user types

Referral System

IMPORTANT: Referral codes must be applied through the npm-app CLI, not through the web interface.

Web onboarding flow shows instructions for entering codes in CLI
Users must type their referral code in the Codebuff terminal after login
Auto-redemption during web login was removed to prevent abuse
The handleReferralCode function in npm-app/src/client.ts handles CLI redemption
The redeemReferralCode function in web/src/app/api/referrals/helpers.ts processes the actual credit granting

OAuth Referral Code Preservation

Problem: NextAuth doesn't preserve referral codes through OAuth flow because:

NextAuth generates its own state parameter for CSRF/PKCE protection
Custom state parameters are ignored/overwritten
OAuth callback URLs don't always survive the round trip

Solution: Multi-layer approach implemented in SignInButton and ReferralRedirect components:

Primary: Use absolute callback URLs with referral codes for better NextAuth preservation
Fallback: Store referral codes in localStorage before OAuth starts
Recovery: ReferralRedirect component on home page catches missed referrals and redirects to onboard page

Environment Variables

This project uses Infisical for secret management. All secrets are injected at runtime.

Release Process

The release mechanism uses the CODEBUFF_GITHUB_TOKEN environment variable directly. The old complex GitHub App token generation system has been removed in favor of using a simple personal access token or the infisical-managed token.

To run any service locally, use the exec runner script from root package.json, which wraps commands with infisical run --.

Example: bun run exec -- bun --cwd backend dev

Environment variables are defined and validated in packages/internal/src/env.ts. This module provides type-safe env objects for use throughout the monorepo.

Bun Wrapper Script

The .bin/bun script automatically wraps bun commands with infisical when secrets are needed. It prevents nested infisical calls by checking for NEXT_PUBLIC_INFISICAL_UP environment variable, ensuring infisical runs only once at the top level while nested bun commands inherit the environment variables.

Worktree Support: The wrapper automatically detects and loads .env.worktree files when present, allowing worktrees to override Infisical environment variables (like ports) for local development. This enables multiple worktrees to run simultaneously on different ports without conflicts.

The wrapper also loads environment variables in the correct precedence order:

Infisical secrets are loaded first (if needed)
.env.worktree is loaded second to override any conflicting variables
This ensures worktree-specific overrides (like custom ports) always take precedence over cached Infisical defaults

The wrapper looks for .env.worktree in the project root directory, making it work consistently regardless of the current working directory when bun commands are executed.

Performance Optimizations: The wrapper uses --silent flag with Infisical to reduce CLI output overhead and sets INFISICAL_DISABLE_UPDATE_CHECK=true to skip version checks for faster startup times.

Infisical Caching: The wrapper implements robust caching of environment variables in .infisical-cache with a 15-minute TTL (configurable via INFISICAL_CACHE_TTL). This reduces startup time from ~1.2s to ~0.16s (87% improvement). The cache uses infisical export which outputs secrets directly in KEY='value' format, ensuring ONLY Infisical-managed secrets are cached (no system environment variables). Multi-line secrets like RSA private keys are handled correctly using source command. Cache automatically invalidates when .infisical.json is modified or after TTL expires. Uses subshell execution to avoid changing the main shell's working directory.

Session Validation: The wrapper detects expired Infisical sessions using infisical export with a robust 10-second timeout implementation that works cross-platform (macOS and Linux). Uses background processes with polling to prevent hanging on interactive prompts. Valid sessions output environment variables in KEY='value' format, while expired sessions either output interactive prompts or timeout. Provides clear error messages directing users to run infisical login.

Python Package

A Python package skeleton exists in python-app. Currently a placeholder that suggests installing the npm version.

Project Templates

Codebuff provides starter templates for initializing new projects:

codebuff --create <template> [project-name]

Templates are maintained in the codebuff community repo. Each directory corresponds to a template usable with the --create flag.

Database Schema and Migrations

Important: When adding database indexes or schema changes, modify the schema file directly (common/src/db/schema.ts) using Drizzle's index syntax, then run the migration generation script to create the actual migration files.

Do NOT write migration SQL files directly. The proper workflow is:

Update common/src/db/schema.ts with new indexes using Drizzle syntax
Run the migration generation script to create the SQL migration files
Apply the migrations using the deployment process

Example of adding performance indexes:

index('idx_table_optimized')
  .on(table.column1, table.column2)
  .where(sql`${table.status} = 'completed'`)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Codebuff

Project Goals

Key Technologies

Main Components

WebSocket Communication Flow

Tool Handling System

Agent System

Shell Shims (Direct Commands)

Package Management

TypeScript Build State Management

Cleaning Build State

Common Issues

Error Handling Philosophy

Error Handling and Debugging

Security Considerations

API Endpoint Architecture

Dependency Injection Pattern

Testing Guidelines

Dependency Injection (Primary Approach)

When to Use spyOn (Secondary Approach)

Avoid mock.module()

Test Setup Patterns

Constants and Configuration

Referral System

OAuth Referral Code Preservation

Environment Variables

Release Process

Bun Wrapper Script

Python Package

Project Templates

Database Schema and Migrations

FilesExpand file tree

knowledge.md

Latest commit

History

knowledge.md

File metadata and controls

Codebuff

Project Goals

Key Technologies

Main Components

WebSocket Communication Flow

Tool Handling System

Agent System

Shell Shims (Direct Commands)

Package Management

TypeScript Build State Management

Cleaning Build State

Common Issues

Error Handling Philosophy

Error Handling and Debugging

Security Considerations

API Endpoint Architecture

Dependency Injection Pattern

Testing Guidelines

Dependency Injection (Primary Approach)

When to Use spyOn (Secondary Approach)

Avoid mock.module()

Test Setup Patterns

Constants and Configuration

Referral System

OAuth Referral Code Preservation

Environment Variables

Release Process

Bun Wrapper Script

Python Package

Project Templates

Database Schema and Migrations