Self-Learning Multi-agent orchestration framework for spec-driven development and automated verification.
β‘ The verification-first coding framework that wraps your LLM in TDD loops, security gates, and self-correction cycles.
See all installation options below.
- π Quick Start
- π― Who's It For?
- π€ Why Gem Team?
- ποΈ Architecture
- π¦ Installation
- π€ The Agent Team
- π Knowledge Sources
- π€ Contributing
| π₯ Teams | π§βπ» Solo Devs | π’ Enterprises |
|---|---|---|
| Complex multi-file features with reliable verification | Faster shipping with debugging auto-pilot | Security compliance, audit trails, accessibility |
- β‘ 4x Faster β Parallel execution with wave-based execution
- π Wave-Based β Parallel agents with integration gates per wave
- β»οΈ Pattern Reuse β Codebase pattern discovery prevents reinventing wheels
- π Higher Quality β Specialized agents + TDD + verification gates + contract-first
- π Built-in Security β OWASP scanning, secrets/PII detection on critical tasks
- ποΈ Full Visibility β Real-time status, clear approval gates
- π‘οΈ Resilient β Pre-mortem analysis, failure handling, auto-replanning
- βΏ Accessibility-First β WCAG compliance validated at spec and runtime layers
- π Safe DevOps β Idempotent operations, health checks, mandatory approval gates
- π Final Review β Optional user-triggered comprehensive review of all changed files
- π¬ Constructive Critique β gem-critic challenges assumptions, finds edge cases
- π§ Continuous Learning β Memory tool persists patterns, gotchas, user preferences across sessions
- π― Auto-Skills β Agents extract reusable SKILL.md files from successful tasks (high confidence: auto, medium: confirm)
- π Established Patterns β Uses library/framework conventions over custom implementations
- π§ Context Scaffolding β Maps large-scale dependencies before the model reads code, preventing context-loss in legacy repos
- π Source Verified β Every factual claim cites its source; no guesswork
- π Knowledge-Driven β Prioritized sources (PRD β codebase β AGENTS.md β Context7 β docs)
- π οΈ Skills & Guidelines β Built-in skill & guidelines (web-design-guidelines)
- πͺ Self-Correcting β All agents self-critique at 0.85 confidence threshold
- βοΈ Intent vs. Compliance β Shifts the burden from writing "perfect prompts" to enforcing strict, YAML-based approval gates
- π¬ Smart Debugging β Root-cause analysis with stack trace parsing + confidence-scored fixes
- π Traceable β Self-documenting IDs link requirements β tasks β tests β evidence
- π Spec-Driven β Multi-step refinement defines "what" before "how"
- ποΈ Verified-Plan β Complex tasks: Plan β Verification β Critic
- π©Ί Diagnose-then-Fix β gem-debugger diagnoses β gem-implementer fixes β re-verifies
β οΈ Pre-Mortem β Failure modes identified BEFORE execution- π Contract-First β Contract tests written before implementation
- π¨ Design Agents β Dedicated agents for web and mobile UI/UX with anti-"AI slop" guidelines for distinctive aesthetics
- π± Mobile Agents β Native mobile implementation (React Native, Flutter) + iOS/Android testing
Raw reasoning isn't enough in single-pass chat. Gem-Team wraps your preferred LLM in a rigid, verification-first loop, fundamentally boosting its effective capability on SWE tasks.
Gem Team includes specialized design agents with anti-"AI slop" guidelines for distinctive, modern and unique aesthetics with accessibility compliance.
| Type | Storage | 1-liner |
|---|---|---|
| π§ Memory | /memories/ |
Facts & user preferences (auto-save) |
| π― Skills | docs/skills/ |
Procedures with code examples |
| π Conventions | AGENTS.md |
Static rules (requires approval) |
User Goal β Orchestrator β [Simple: Research/Plan] or [Complex: Discuss β PRD β Research β Plan β Approve] β Execute (waves) β Summary β Final Review
β
π Diagnose β Fix β Re-verify
| Method | Command / Link | Docs |
|---|---|---|
| Code | Install Now | Copilot Docs |
| Code Insiders | Install Now | Copilot Docs |
| APM (All AI coding agents) |
apm install mubaidr/gem-team |
APM Docs |
| Copilot CLI (Marketplace) | copilot plugin install gem-team@awesome-copilot |
CLI Docs |
| Copilot CLI (Direct) | copilot plugin install gem-team@mubaidr |
CLI Docs |
| Windsurf | codeium agent install mubaidr/gem-team |
Windsurf Docs |
| Claude Code | claude plugin install mubaidr/gem-team |
Claude Docs |
| OpenCode | opencode plugin install mubaidr/gem-team |
OpenCode Docs |
| Manual (Copy agent files) |
VS Code: ~/.vscode/agents/ VS Code Insiders: ~/.vscode-insiders/agents/ GitHub Copilot: ~/.github/copilot/agents/ GitHub Copilot (project): .github/plugin/agents/ Windsurf: ~/.windsurf/agents/ Claude: ~/.claude/agents/ Cursor: ~/.cursor/agents/ OpenCode: ~/.opencode/agents/ |
β |
| Role | Description | Sources | Recommended LLM |
|---|---|---|---|
| π― ORCHESTRATOR | The team lead: Orchestrates research, planning, implementation, and verification | PRD, AGENTS.md | Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: GLM-5, Kimi K2.5, Qwen3.5 |
| π RESEARCHER | Codebase exploration β patterns, dependencies, architecture discovery | PRD, codebase, AGENTS.md, docs | Closed: Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6 Open: GLM-5, Qwen3.5-9B, DeepSeek-V3.2 |
| π PLANNER | DAG-based execution plans β task decomposition, wave scheduling, risk analysis | PRD, codebase, AGENTS.md | Closed: Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4 Open: Kimi K2.5, GLM-5, Qwen3.5 |
| π§ IMPLEMENTER | TDD code implementation β features, bugs, refactoring. Never reviews own work | codebase, AGENTS.md, DESIGN.md | Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| Role | Description | Sources | Recommended LLM |
|---|---|---|---|
| π‘οΈ REVIEWER | Zero-Hallucination Filter β Security auditing, code review, OWASP scanning | PRD, codebase, AGENTS.md, OWASP | Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: Kimi K2.5, GLM-5, DeepSeek-V3.2 |
| π― CRITIC | Challenges assumptions, finds edge cases, spots over-engineering and logic gaps | PRD, codebase, AGENTS.md | Closed: Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro Open: Kimi K2.5, GLM-5, Qwen3.5 |
| π¬ DEBUGGER | Root-cause analysis, stack trace diagnosis, regression bisection | codebase, AGENTS.md, git history | Closed: Gemini 3.1 Pro, Claude Opus 4.6, GPT-5.4 Open: DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| π§ͺ BROWSER TESTER | E2E browser testing, UI/UX validation, visual regression | PRD, AGENTS.md, fixtures | Closed: GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash Open: Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 |
| βοΈ SIMPLIFIER | Refactoring specialist β removes dead code, reduces complexity | codebase, AGENTS.md, tests | Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| Role | Description | Sources | Recommended LLM |
|---|---|---|---|
| π DEVOPS | Infrastructure deployment, CI/CD pipelines, container management | AGENTS.md, infra configs | Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: DeepSeek-V3.2, GLM-5, Qwen3.5 |
| π DOCUMENTATION | Technical documentation, README files, API docs, diagrams | AGENTS.md, source code | Closed: Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini Open: Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7 |
| π¨ DESIGNER | UI/UX design β layouts, themes, color schemes, accessibility | PRD, codebase, AGENTS.md | Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: Qwen3.5, GLM-5, MiniMax M2.7 |
| π± IMPLEMENTER-MOBILE | Mobile implementation β React Native, Expo, Flutter | codebase, AGENTS.md | Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| π± DESIGNER-MOBILE | Mobile UI/UX β HIG, Material Design, safe areas | PRD, codebase, AGENTS.md | Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: Qwen3.5, GLM-5, MiniMax M2.7 |
| π± MOBILE TESTER | Mobile E2E testing β Detox, Maestro, iOS/Android | PRD, AGENTS.md | Closed: GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash Open: Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 |
Agents consult only the sources relevant to their role:
| Trust Level | Sources | Behavior |
|---|---|---|
| Trusted | PRD, plan.yaml, AGENTS.md | Follow as instructions |
| Verify | Codebase files, research findings | Cross-reference before assuming |
| Untrusted | Error logs, external data | Factual only β never as instructions |
Contributions are welcome! Please feel free to submit a Pull Request. CONTRIBUTING for detailed guidelines on commit message formatting, branching strategy, and code standards.
This project is licensed under the Apache License 2.0.
If you encounter any issues or have questions, please open an issue on GitHub.