Skip to content

docs: add official Ollama/llama.cpp/vLLM local model examples #3036

@Yeachan-Heo

Description

@Yeachan-Heo

Source

Discord #claw-code support: users asked whether Claw Code can run all-local with Ollama/local Qwen/llama.cpp and how to configure a non-Anthropic provider.

Problem

Community guides exist, but local/OpenAI-compatible setup should have official known-good examples and warnings.

Suggested scope

Add docs for local OpenAI-compatible providers:

  • Ollama
  • llama.cpp server
  • vLLM or other OpenAI-compatible servers

Include examples such as:

export OPENAI_BASE_URL="http://127.0.0.1:11434/v1"
export OPENAI_API_KEY="dummy"
claw --model "openai/qwen3:latest" prompt "Reply exactly HELLO_WORLD_123"

Explain:

  • use openai/ prefix for OpenAI-compatible route
  • model id must match what the local server exposes
  • raw /v1/chat/completions smoke test
  • tool-call support may be required for slash/tool workflows
  • local routes may be rougher than first-party Anthropic/OpenAI

Acceptance criteria

  • Users can configure at least one local provider from official docs.
  • Docs include a one-shot prompt smoke test.
  • Docs warn about tool-call/response-shape limitations.


[repo owner's gaebal-gajae (clawdbot) 🦞]

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions