Explorbot

The vibe-testing agent for web applications.

Explorbot explores your web app like a curious human would — clicking around, filling forms, finding bugs, and learning as it goes. No test scripts required. Just point it at your app and let it work.

explorbot start https://your-app.com

Explorbot is your first assitant in testing. It will do its best to use your application with no babysitting. It will use application and provide you valuable feedback.

Use Cases

Autonomously test web application or its parts
Discover test scenarios and get automated tests for them
Write manual test cases from exploring website
24h/7d of monkey-testing for web application that can reveal hidden errors
Quick-test for MVPs and prototypes

Explorbot can start testing features which were not covered by unit tests or browser tests.

Demo

Requirements

Bun (not Node.js)
AI provider API key — Groq, Cerebras, OpenAI, or Anthropic
Modern terminal — iTerm2, WARP, Kitty, Ghostty. WSL if running on Windows
Compatible web app — Check docs/prerequisites.md to verify your app works with Explorbot

Quick Start

1. Install dependencies

bun install
bunx playwright install

2. Initialize config

explorbot init

3. Edit explorbot.config.js — set your app URL and AI provider:

Important

Explorbot uses two tiers of models. Most agents (Tester, Navigator, Researcher) are token-hungry — they process full HTML and ARIA snapshots on every iteration. Use a fast, cheap model like gpt-oss-20b for these. Pilot is different — it only reads compact action logs and makes high-level decisions, so it benefits from a smarter reasoning model. Recommended providers for base model (100+ TPS): Groq, Cerebras. See OpenRouter for fastest models.

Groq is used in this example but you can use any provider supported by Vercel AI SDK. See docs/providers.md for other providers.

import { createGroq } from '@ai-sdk/groq';
import { createOpenAI } from '@ai-sdk/openai';

const groq = createGroq({
  apiKey: process.env.GROQ_API_KEY,
});

const openai = createOpenAI({
  apiKey: process.env.OPENAI_API_KEY,
});

export default {
  playwright: {
    browser: 'chromium',
    url: 'https://your-app.com',     // <-- Your app URL
  },
  ai: {
    provider: groq,
    model: 'gpt-oss-20b',            // Fast cheap model for most agents
    visionModel: 'llama-scout-4',    // Fast vision model
    agents: {
      pilot: { provider: openai, model: 'gpt-5' },  // Smarter model for Pilot
    },
  },
};

Tip

Suggested Pilot models: GPT-5, Claude Sonnet, Kimi K2, Qwen 3. Pilot barely uses tokens (just action summaries), so a smarter model here costs very little while significantly improving test quality.

4. Add knowledge (optional but recommended)

If your app requires authentication, tell Explorbot how to log in:

# Interactive mode
explorbot know

# Or via CLI
explorbot know "/login" "Use credentials: admin@example.com / secret123"

Tip

Use --session to persist browser cookies and localStorage between runs. Log in once, and Explorbot will restore the session on next start:

explorbot start /login --session          # saves to output/session.json
explorbot start /dashboard --session      # restores session, skips login
explorbot start /app --session auth.json  # custom session file

Note

Use * as URL pattern to add general knowledge that applies to all pages. See docs/knowledge.md for more.

5. Run

explorbot start /admin/users

Start from a small functional area of your app (admin panel, settings, any CRUD section) so Explorbot can quickly understand its business purpose and context.

Browser runs headless by default — use --show to see it:

explorbot start /settings --show

Requires a modern terminal (iTerm2, WARP, Kitty, Ghostty, Windows Terminal). On Windows, use WSL.

How It Works

Explorbot explores websites, analyzes their UI, and proposes tests — which it can then execute. It controls its own browser through CodeceptJS → Playwright (no MCP involved).

flowchart LR
    N[🧭 Navigator] --> R[🔍 Researcher] --> P[📋 Planner] --> T[🧪 Tester]
    Pi[🎯 Pilot] -.->|supervises| T

🧭 Navigator	🔍 Researcher	📋 Planner	🧪 Tester
Opens pages	Analyzes UI	Generates test scenarios	Executes tests
Clicks buttons, fills forms	Discovers all interactive elements	Assigns priorities (HIGH/MED/LOW)	Adapts when things fail
Self-heals broken selectors	Expands hidden content	Balances positive & negative cases	Documents results

Run /explore in TUI or use explorbot explore from CLI to watch the cycle: research → plan → test → repeat.

Supporting components:

Pilot — supervises Tester from a separate conversation: reviews action logs, detects stuck patterns, makes final pass/fail decisions. Uses a smarter model since it only processes compact summaries, not raw HTML
Historian — saves sessions as CodeceptJS code, learns from experience
Quartermaster — analyzes pages for A11y issues (axe-core + semantic)
Reporter — sends test results to Testomat.io

Basic Usage

Once in the terminal UI:

/explore              # Full cycle: research → plan → test
/research             # Analyze current page
/plan                 # Generate test scenarios
/test                 # Run next test
/navigate /settings   # Go to a page

You can also run CodeceptJS commands directly:

I.click('Login')
I.fillField('email', 'test@example.com')
I.see('Welcome')

See docs/commands.md for all commands.

Note

Most TUI commands also have CLI equivalents that run headless and exit. For example, explorbot research <url> and explorbot plan <path> work without launching TUI. See docs/commands.md for the full mapping.

What You Get

Output	Location	Description
Test files	`output/tests/*.js`	CodeceptJS tests you can run independently
Test plans	`output/plans/*.md`	Markdown documentation of scenarios
Experience	`./experience/`	What Explorbot learned about your app

Two Ways to Run

Interactive mode — Launch TUI, guide exploration, get real-time feedback:

explorbot start https://your-app.com

Autonomous mode — Non-interactive testing and planning:

explorbot explore /admin/users

Freesail mode — Fully autonomous, continuous exploration across multiple pages:

explorbot freesail /admin              # explore and test pages indefinitely
explorbot freesail /app --deep         # depth-first: explore nearby pages first
explorbot freesail /app --shallow      # breadth-first: spread across many pages
explorbot freesail /app --scope /admin # restrict to URLs under /admin

Freesail navigates to a page, researches it, runs tests, then moves on to the next least-visited page — repeating until stopped. Also available as /freesail in TUI.

Core Philosophy

Strategic decisions are deterministic — The workflow (research → plan → test) is predictable and consistent.

Tactical decisions are AI-driven — How to click that button, what to do when a modal appears, how to recover from errors.

Cheap workers, smart manager — Tester, Navigator, and Researcher are token-hungry agents that chew through HTML and ARIA on every step. They run on fast, cheap models. Pilot is the manager — it reads only compact action logs, thinks about what went wrong, and guides Tester. Give Pilot a smarter model for better results at negligible extra cost.

Explorbot learns from its failures — It uses previous experience interacting with a web page for faster and better decisions on next runs.

Explorbot needs your knowledge — You adjust Explorbot prompts by passing suggestions, UI explanations, and domain knowledge as text files, which are loaded when the corresponding page is opened.

When tuned, Explorbot can run autonomously for hours navigating a web application and trying different scenarios. You don't need to watch it. The more Explorbot runs, the more it learns and the more complex scenarios it can test.

Teaching Explorbot

Knowledge (./knowledge/) — Tell Explorbot about your app: credentials, form rules, navigation quirks. See docs/knowledge.md.
Experience (./experience/) — Explorbot learns automatically from successful interactions and saves what works.

FAQ

Can I run it in Cursor? or Claude Code? No, Explorbot is a separate application designed for constant testing. Cursor, Codex, or Claude Code are coding agents — not relevant here.

Why do you hate Opus? Opus is great for coding. Here we need a simple model that can consume lots of HTML tokens to find the relevant ones. Leave more interesting tasks to Opus.

Is that expensive? No. It costs ~$1 per hour of running if you use Groq Cloud with gpt-oss-20b.

Does Explorbot have MCP? Not yet.

Can I build my own agents with it? Yes, use the programmatic API. See docs/scripting.md.

Ok, but I can do the same in Cursor with Playwright MCP! Good luck running it on CI! Also, you'll need to check on it every 10 seconds to see how it's running the browser.

Explorbot learns as it explores. The more it tests your app, the better it gets at testing your app. That's vibe-testing.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
.claude		.claude
.cursor		.cursor
.github/workflows		.github/workflows
assets		assets
bin		bin
docs		docs
prompts		prompts
src		src
test-data		test-data
tests		tests
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
AGENTS.md		AGENTS.md
Bunoshfile.js		Bunoshfile.js
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
README.md		README.md
biome.json		biome.json
bun.lock		bun.lock
bunfig.toml		bunfig.toml
explorbot.config.example.ts		explorbot.config.example.ts
explorbot.config.js		explorbot.config.js
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Explorbot

Use Cases

Demo

Requirements

Quick Start

How It Works

Basic Usage

What You Get

Two Ways to Run

Core Philosophy

Teaching Explorbot

Further Reading

FAQ

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

testomatio/explorbot

Folders and files

Latest commit

History

Repository files navigation

Explorbot

Use Cases

Demo

Requirements

Quick Start

How It Works

Basic Usage

What You Get

Two Ways to Run

Core Philosophy

Teaching Explorbot

Further Reading

FAQ

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages