Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
15 changes: 15 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,20 @@
# Changelog

## [0.15.2.0] - 2026-04-01 β€” Multi-Model Debate

You can now pit Claude against Codex in a structured debate on any codebase question. `/debate should we use Redis or Postgres` runs multiple rounds where each model cites file:line evidence, rebuts the other's arguments, and recommends a resolution. The debate stops when both sides converge, one concedes, or the round cap hits. You're always the final judge.

### Added

- **`/debate` skill.** Structured multi-model debate with convergence detection. Each round produces: position, evidence with file:line citations, rebuttal, concession, and recommended resolution. Default 3 rounds, hard cap 5.
- **`--interactive` / `-i` flag.** Opt-in mid-debate checkpoints where you see each round's arguments and choose whether to continue, stop, or redirect. Without the flag, rounds run automatically.
- **Codex fallback.** If Codex CLI isn't installed, the debate uses a Claude adversarial subagent instead. Same structured output contract, same convergence detection. Transcript labels the source so you know which model argued what.
- **Debate transcripts.** Every debate saves a full transcript to `.context/debate-transcript-{timestamp}.md` with all rounds, convergence checks, synthesis, and your final decision.

### Cost profile

This skill is intentionally expensive. A 3-round debate costs roughly the same as a full `/codex review`. We tested lower reasoning effort and it produced shallower arguments that missed real issues. The cost lever is `--max-rounds`, not reasoning quality.

## [0.15.1.0] - 2026-04-01 β€” Design Without Shotgun

You can now run `/design-html` without having to run `/design-shotgun` first. The skill detects what design context exists (CEO plans, design review artifacts, approved mockups) and asks how you want to proceed. Start from a plan, a description, or a provided PNG, not just an approved mockup.
Expand Down
1 change: 1 addition & 0 deletions CLAUDE.md
Original file line number Diff line number Diff line change
Expand Up @@ -86,6 +86,7 @@ gstack/
β”œβ”€β”€ benchmark/ # /benchmark skill (performance regression detection)
β”œβ”€β”€ canary/ # /canary skill (post-deploy monitoring loop)
β”œβ”€β”€ codex/ # /codex skill (multi-AI second opinion via OpenAI Codex CLI)
β”œβ”€β”€ debate/ # /debate skill (multi-model structured debate with convergence)
β”œβ”€β”€ land-and-deploy/ # /land-and-deploy skill (merge β†’ deploy β†’ canary verify)
β”œβ”€β”€ office-hours/ # /office-hours skill (YC Office Hours β€” startup diagnostic + builder brainstorm)
β”œβ”€β”€ investigate/ # /investigate skill (systematic root-cause debugging)
Expand Down
2 changes: 1 addition & 1 deletion VERSION
Original file line number Diff line number Diff line change
@@ -1 +1 @@
0.15.1.0
0.15.2.0
912 changes: 912 additions & 0 deletions debate/SKILL.md

Large diffs are not rendered by default.

Loading
Loading