Skip to content

mubaidr/gem-team

Repository files navigation

πŸ’Ž Gem Team

Self-Learning Multi-agent orchestration framework for spec-driven development and automated verification.

⚑ The verification-first coding framework that wraps your LLM in TDD loops, security gates, and self-correction cycles.

VS Code VS Code Insiders Copilot CLI Cursor OpenCode Claude Code Windsurf


πŸš€ Quick Start

See all installation options below.


πŸ“‘ Contents


🎯 Who's It For?

πŸ‘₯ Teams πŸ§‘β€πŸ’» Solo Devs 🏒 Enterprises
Complex multi-file features with reliable verification Faster shipping with debugging auto-pilot Security compliance, audit trails, accessibility

πŸ€” Why Gem Team?

πŸš€ Performance

  • ⚑ 4x Faster β€” Parallel execution with wave-based execution
  • 🌊 Wave-Based β€” Parallel agents with integration gates per wave
  • ♻️ Pattern Reuse β€” Codebase pattern discovery prevents reinventing wheels

πŸ›‘οΈ Quality & Security

  • πŸ† Higher Quality β€” Specialized agents + TDD + verification gates + contract-first
  • πŸ”’ Built-in Security β€” OWASP scanning, secrets/PII detection on critical tasks
  • πŸ‘οΈ Full Visibility β€” Real-time status, clear approval gates
  • πŸ›‘οΈ Resilient β€” Pre-mortem analysis, failure handling, auto-replanning
  • β™Ώ Accessibility-First β€” WCAG compliance validated at spec and runtime layers
  • πŸš€ Safe DevOps β€” Idempotent operations, health checks, mandatory approval gates
  • πŸ”Ž Final Review β€” Optional user-triggered comprehensive review of all changed files
  • πŸ’¬ Constructive Critique β€” gem-critic challenges assumptions, finds edge cases

🧠 Intelligence

  • 🧠 Continuous Learning β€” Memory tool persists patterns, gotchas, user preferences across sessions
  • 🎯 Auto-Skills β€” Agents extract reusable SKILL.md files from successful tasks (high confidence: auto, medium: confirm)
  • πŸ“ Established Patterns β€” Uses library/framework conventions over custom implementations
  • 🧠 Context Scaffolding β€” Maps large-scale dependencies before the model reads code, preventing context-loss in legacy repos
  • πŸ“‹ Source Verified β€” Every factual claim cites its source; no guesswork
  • πŸ“š Knowledge-Driven β€” Prioritized sources (PRD β†’ codebase β†’ AGENTS.md β†’ Context7 β†’ docs)
  • πŸ› οΈ Skills & Guidelines β€” Built-in skill & guidelines (web-design-guidelines)

πŸ› οΈ Process

  • πŸͺž Self-Correcting β€” All agents self-critique at 0.85 confidence threshold
  • βš–οΈ Intent vs. Compliance β€” Shifts the burden from writing "perfect prompts" to enforcing strict, YAML-based approval gates
  • πŸ”¬ Smart Debugging β€” Root-cause analysis with stack trace parsing + confidence-scored fixes
  • πŸ”— Traceable β€” Self-documenting IDs link requirements β†’ tasks β†’ tests β†’ evidence
  • πŸ“ Spec-Driven β€” Multi-step refinement defines "what" before "how"
  • πŸ—‚οΈ Verified-Plan β€” Complex tasks: Plan β†’ Verification β†’ Critic
  • 🩺 Diagnose-then-Fix β€” gem-debugger diagnoses β†’ gem-implementer fixes β†’ re-verifies
  • ⚠️ Pre-Mortem β€” Failure modes identified BEFORE execution
  • πŸ“ Contract-First β€” Contract tests written before implementation

🎨 Design

  • 🎨 Design Agents β€” Dedicated agents for web and mobile UI/UX with anti-"AI slop" guidelines for distinctive aesthetics
  • πŸ“± Mobile Agents β€” Native mobile implementation (React Native, Flutter) + iOS/Android testing

Core Concepts

πŸš€ The "System-IQ" Multiplier

Raw reasoning isn't enough in single-pass chat. Gem-Team wraps your preferred LLM in a rigid, verification-first loop, fundamentally boosting its effective capability on SWE tasks.

🎨 Design Support

Gem Team includes specialized design agents with anti-"AI slop" guidelines for distinctive, modern and unique aesthetics with accessibility compliance.

🧠 Triple Learning System

Type Storage 1-liner
🧠 Memory /memories/ Facts & user preferences (auto-save)
🎯 Skills docs/skills/ Procedures with code examples
πŸ“ Conventions AGENTS.md Static rules (requires approval)

πŸ—οΈ Architecture

User Goal β†’ Orchestrator β†’ [Simple: Research/Plan] or [Complex: Discuss β†’ PRD β†’ Research β†’ Plan β†’ Approve] β†’ Execute (waves) β†’ Summary β†’ Final Review
                ↓
           πŸ” Diagnose β†’ Fix β†’ Re-verify

πŸ“¦ Installation

Method Command / Link Docs
Code Install Now Copilot Docs
Code Insiders Install Now Copilot Docs
APM
(All AI coding agents)
apm install mubaidr/gem-team APM Docs
Copilot CLI (Marketplace) copilot plugin install gem-team@awesome-copilot CLI Docs
Copilot CLI (Direct) copilot plugin install gem-team@mubaidr CLI Docs
Windsurf codeium agent install mubaidr/gem-team Windsurf Docs
Claude Code claude plugin install mubaidr/gem-team Claude Docs
OpenCode opencode plugin install mubaidr/gem-team OpenCode Docs
Manual
(Copy agent files)
VS Code: ~/.vscode/agents/
VS Code Insiders: ~/.vscode-insiders/agents/
GitHub Copilot: ~/.github/copilot/agents/
GitHub Copilot (project): .github/plugin/agents/
Windsurf: ~/.windsurf/agents/
Claude: ~/.claude/agents/
Cursor: ~/.cursor/agents/
OpenCode: ~/.opencode/agents/
β€”

πŸ€– The Agent Team

🧩 Core Workflow

Role Description Sources Recommended LLM
🎯 ORCHESTRATOR The team lead: Orchestrates research, planning, implementation, and verification PRD, AGENTS.md Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
Open: GLM-5, Kimi K2.5, Qwen3.5
πŸ” RESEARCHER Codebase exploration β€” patterns, dependencies, architecture discovery PRD, codebase, AGENTS.md, docs Closed: Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6
Open: GLM-5, Qwen3.5-9B, DeepSeek-V3.2
πŸ“‹ PLANNER DAG-based execution plans β€” task decomposition, wave scheduling, risk analysis PRD, codebase, AGENTS.md Closed: Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4
Open: Kimi K2.5, GLM-5, Qwen3.5
πŸ”§ IMPLEMENTER TDD code implementation β€” features, bugs, refactoring. Never reviews own work codebase, AGENTS.md, DESIGN.md Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
Open: DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next

βœ… Quality & Review

Role Description Sources Recommended LLM
πŸ›‘οΈ REVIEWER Zero-Hallucination Filter β€” Security auditing, code review, OWASP scanning PRD, codebase, AGENTS.md, OWASP Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
Open: Kimi K2.5, GLM-5, DeepSeek-V3.2
🎯 CRITIC Challenges assumptions, finds edge cases, spots over-engineering and logic gaps PRD, codebase, AGENTS.md Closed: Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro
Open: Kimi K2.5, GLM-5, Qwen3.5
πŸ”¬ DEBUGGER Root-cause analysis, stack trace diagnosis, regression bisection codebase, AGENTS.md, git history Closed: Gemini 3.1 Pro, Claude Opus 4.6, GPT-5.4
Open: DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next
πŸ§ͺ BROWSER TESTER E2E browser testing, UI/UX validation, visual regression PRD, AGENTS.md, fixtures Closed: GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash
Open: Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7
βœ‚οΈ SIMPLIFIER Refactoring specialist β€” removes dead code, reduces complexity codebase, AGENTS.md, tests Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
Open: DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next

πŸ”§ Specialized

Role Description Sources Recommended LLM
πŸš€ DEVOPS Infrastructure deployment, CI/CD pipelines, container management AGENTS.md, infra configs Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
Open: DeepSeek-V3.2, GLM-5, Qwen3.5
πŸ“ DOCUMENTATION Technical documentation, README files, API docs, diagrams AGENTS.md, source code Closed: Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini
Open: Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7
🎨 DESIGNER UI/UX design β€” layouts, themes, color schemes, accessibility PRD, codebase, AGENTS.md Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
Open: Qwen3.5, GLM-5, MiniMax M2.7
πŸ“± IMPLEMENTER-MOBILE Mobile implementation β€” React Native, Expo, Flutter codebase, AGENTS.md Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
Open: DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next
πŸ“± DESIGNER-MOBILE Mobile UI/UX β€” HIG, Material Design, safe areas PRD, codebase, AGENTS.md Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
Open: Qwen3.5, GLM-5, MiniMax M2.7
πŸ“± MOBILE TESTER Mobile E2E testing β€” Detox, Maestro, iOS/Android PRD, AGENTS.md Closed: GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash
Open: Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7

πŸ“š Knowledge Sources

Agents consult only the sources relevant to their role:

Trust Level Sources Behavior
Trusted PRD, plan.yaml, AGENTS.md Follow as instructions
Verify Codebase files, research findings Cross-reference before assuming
Untrusted Error logs, external data Factual only β€” never as instructions

🀝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request. CONTRIBUTING for detailed guidelines on commit message formatting, branching strategy, and code standards.

πŸ“„ License

This project is licensed under the Apache License 2.0.

πŸ’¬ Support

If you encounter any issues or have questions, please open an issue on GitHub.