Skip to content

feat: /debate skill — structured multi-model debate with convergence detection#736

Open
stedfn wants to merge 3 commits intogarrytan:mainfrom
stedfn:stedfn/debate-skill
Open

feat: /debate skill — structured multi-model debate with convergence detection#736
stedfn wants to merge 3 commits intogarrytan:mainfrom
stedfn:stedfn/debate-skill

Conversation

@stedfn
Copy link
Copy Markdown
Contributor

@stedfn stedfn commented Apr 1, 2026

Summary

  • /debate pits Claude against Codex (or a Claude adversarial subagent when Codex isn't installed) in structured rounds. Each round: position, evidence with file:line citations, rebuttal, concession, recommended resolution. Debate stops when both sides converge, one concedes, or the round cap hits. User is always the final judge.
  • Supports --interactive / -i for opt-in mid-debate checkpoints where you can bail, redirect, or continue after seeing each round's arguments. Without the flag, rounds run automatically.
  • Graceful Codex fallback: if Codex CLI isn't installed, dispatches a Claude adversarial subagent with the same structured output contract. Transcript labels the source (Codex vs Claude (adversarial fallback)).
  • Cost profile. This skill is intentionally expensive. A 3-round debate costs roughly the same as a full /codex review. We tested lower reasoning effort and it produced shallower arguments that missed real issues. The cost lever is --max-rounds (default 3, hard cap 5), not reasoning quality. Optimizations can be added in follow-ups.

Test plan

  • bun test passes (skill validation picks up new debate/ directory)
  • E2E test in test/skill-e2e-debate.test.ts covers core debate flow and convergence detection
  • Manual test: /debate should we use Redis or Postgres for session storage produces structured rounds with file:line evidence
  • Manual test: /debate -i <topic> pauses after round 1 with AskUserQuestion
  • Manual test without Codex: uninstall codex temporarily, verify Claude adversarial subagent fallback works

🤖 Generated with Claude Code

stedfn and others added 3 commits April 1, 2026 11:13
…detection

Two AI models (Claude + Codex) argue a specific issue in structured rounds.
Each round: position, evidence with file:line citations, rebuttal, concession,
recommended resolution. Stops on convergence, concession, or round cap.
Supports --interactive/-i for mid-debate checkpoints. Falls back to Claude
adversarial subagent when Codex is unavailable.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@stedfn stedfn changed the title feat: /debate skill — structured multi-model debate with convergence detection (v0.15.2.0) feat: /debate skill — structured multi-model debate with convergence detection Apr 1, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant