A governed skill system for long-horizon human-AI collaboration.
Stop building prompt soup. Govern the tree.
Governed Skill Tree is an organ-based skill system for human-AI collaboration.
It separates routing, phase control, tool choice, risk interception, verification, memory, and communication into explicit governed layers so a skill tree can grow without turning into prompt soup.
If MCP gives AI hands, this project explores what else must exist for AI to work more like a real collaborator: routing, control, memory, judgment, verification, and communication.
This first public batch focuses on the minimum set needed to show the design clearly.
这是一个面向人机协作的“受治理技能树”实验。
它不是简单堆一堆 prompt 或 skill,而是把一套长期会增长的 AI 能力系统,拆成更清楚的职责层:
如果把 AI 比作一个人,MCP 更像是补齐“手”和工具调用能力;而这个项目想补的是路由、控制、记忆、验证、表达这些让 AI 更像真实协作者的其他器官。
brain负责任务归类和能力路由controller负责阶段判断和阶段切换brain-tool-routing负责当前步骤的工具选择balance负责风险拦截、假设审计和影响范围约束leg负责实现与验证memory/collab负责记忆、交接和对人表达
当前公开版先放通用核心,再放一个 Unity showcase pack 作为成熟案例。
也就是说,Unity 不是这套系统的全部,而是目前最完整的链路。
If you're new to this repo, use this reading order:
-
This README
- Understand the core idea, system flow, and public scope.
-
docs/governance.md- Read this to understand authority boundaries, conflict handling, and why the system is governed instead of flattened.
-
docs/intake-pipeline.md- Read this to see how work moves through the system: bootstrap → routing → phase control → tool choice → execution → verification → handoff.
-
docs/evaluation-notes.md- Read this if you want the rationale behind the separation model and the failure modes this system is designed to reduce.
-
SKILL_TREE.md- Use this as the logical map of the current skill structure, migration status, and layering strategy.
-
Cases- Read these if you want to understand why governed structure changes system behavior over time.
-
Examples- Start with the example closest to your use case:
- Ambiguous Task Routing → best entry point for request classification
- Pre-Change Risk And Dependency Scan → best entry point for safe execution
- Unity Debug To Runtime Smoke → best entry point for domain workflow
- Evaluating a Skill Growth Request → best entry point for governed evolution
If you only read one thing after this README, read docs/governance.md first.
These cases show how governed skill systems behave differently from flat prompt or skill collections.
-
Flat Skill Pile vs Governed Flow
Why structure matters before scale. -
Skill Growth Proposal
Why not every repeated problem should create a new skill. -
Handoff and Continuity
Why long-horizon collaboration needs explicit continuity design.
-
docs/release-notes-v0.1.0.md
First public slice of the repository. -
docs/release-scope.md
What is intentionally included in the public slice, and what is intentionally left out. -
docs/release-checklist.md
Release readiness checklist for this repository.
This repo is for people who are trying to move beyond:
- flat skill folders
- giant prompt piles
- loosely coupled agent behaviors
- unclear routing between planning, execution, verification, and handoff
It is especially relevant if you care about:
- long-horizon human-AI collaboration
- governed task routing
- explicit phase control
- safer domain execution
- deliberate skill growth over time
Most skill collections collapse several responsibilities into one fuzzy layer.
This project pushes them apart on purpose:
- routing is not phase control
- phase control is not tool choice
- tool choice is not risk approval
- growth is not "just add another skill"
- domain packs are not the same thing as the generic core
That separation is the main idea.
- explicit routing authority
- phase control
- tool-surface boundaries
- risk interception
- proportional verification
- memory and collaboration rules
- a unified skill authoring template
Most agent skill sets become harder to maintain as they grow:
- boundaries start overlapping
- routing becomes fuzzy
- "just use the right tool" is not enough
- migration gets messy
- the same judgment keeps getting re-invented
This project treats the skill layer as a governed operating surface, not just a registry.
organ-based logical layeringSkills are grouped by what kind of responsibility they own.flat physical storageLive skills still sit in a flatskills/directory for runtime compatibility.hard authority boundariesRouting, phase control, tool choice, risk interception, execution, and human-facing output do not silently collapse into one layer.task intake pipelineA task should enter the system in a predictable order.gradual evolutionThe system should be able to grow without becoming a naming graveyard.
This release focuses on the smallest set that shows the system's shape clearly.
brain-workspace-bootstrapbrain-skill-orchestratorcontroller-work-state-controllerbrain-tool-routingbalance-assumption-auditbalance-dependency-impact-scanbrain-capability-gap-detectear-interaction-intent-parseeye-state-diff-inspectorcollab-tone-regulatormemory-memory-routermemory-layered-memoryleg-verification-runner
balance-unity-codex-guardeye-unity-log-debugeye-unity-scene-auditleg-unity-runtime-smoke
The Unity pack is included as a flagship domain example, not as the whole identity of the system.
The system stays governable by separating the main path of work into distinct layers:
flowchart LR
A[Workspace Bootstrap] --> B[Capability Routing]
B --> C[Phase Control]
C --> D[Tool Routing]
D --> E[Domain Skill Execution]
E --> F[Verification]
F --> G[Compression / Handoff / Storage]
routing is not phase controlbrain-skill-orchestratorchooses the capability family, but not the work phase.phase control is not tool choicecontroller-work-state-controllerowns phase transitions, but not which execution surface to use.tool choice is not risk approvalbrain-tool-routingcan choose the current tool surface, but cannot approve risky work.governance is explicitbalance-*skills can intercept unsafe moves instead of relying on vague "be careful" wording.growth is governedbrain-capability-gap-detecthelps decide whether a new skill is actually needed before the tree expands.
This repository is intentionally not:
- a giant prompt dump
- a replacement for every agent framework
- a Unity-only toolkit
- a claim that one naming scheme solves coordination by itself
The goal is more practical:
build a skill system that can keep growing without losing routing clarity, safety boundaries, or maintainability.
Default intake flow:
- bootstrap the workspace if needed
- classify the task and choose the lead organ
- determine the current work phase
- choose the current execution surface
- run the domain skill
- verify proportionally
- compress, hand off, or store the result
Read:
- SKILL_TREE.md
- docs/governance.md
- docs/intake-pipeline.md
- docs/metadata-schema.md
- docs/release-scope.md
- docs/what-this-is-not.md
. ├─ README.md ├─ SKILL_TREE.md ├─ SKILL_TEMPLATE.md ├─ cases/ ├─ docs/ ├─ examples/ ├─ skills/
The skills/ directory is intentionally flat.
That is part of the runtime compatibility model, not an accident.
If you want to try the structure in a Codex-compatible environment:
- copy the skill folders from
skills/into your$CODEX_HOME/skills - keep the physical layout flat
- use SKILL_TREE.md as the governed index
- use SKILL_TEMPLATE.md for new skill authoring
- Ambiguous Task Routing
- Pre-Change Risk And Dependency Scan
- Unity Debug To Runtime Smoke
- Skill Growth Proposal
If you want to extend the tree, start here:
New public skills should strengthen the governance model, not bypass it.
Near-term priorities:
- refine more public examples
- add clearer domain-pack boundaries
- publish evaluation and comparison notes
- expand beyond the first showcase pack carefully
Before creating the public repository or first tag, use:
This repository is licensed under Apache-2.0.
This repository is a deliberate first slice, not the entire internal system.
It prioritizes:
- the governed core
- the metadata model
- the intake pipeline
- one mature domain showcase
It intentionally does not try to publish every internal skill on day one.