You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(config): extend AgentDefinition with provider routing and ProviderTier enum
Add provider, fallback_provider, fallback_model, and provider_tier fields
to AgentDefinition for subscription-based model routing (ADR-002, ADR-003).
Add ProviderTier enum (Quick/Deep/Implementation/Oracle) with per-tier
timeout values. Add opencode CLI support in spawner arg inference.
All new fields are Optional with serde(default) for backward compatibility.
Fixes#28
Refs #29
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
In the context of the ADF agent fleet dispatching tasks to LLM providers via the opencode CLI, facing the discovery that the `opencode/` (Zen) provider prefix routes through a pay-per-use proxy with significant markup, we decided to ban the `opencode/` prefix entirely and route all agent dispatch through subscription-based providers, accepting that we must maintain multiple provider subscriptions.
13
+
14
+
## Decision Drivers
15
+
16
+
*`opencode/kimi-k2.5` via Zen costs significantly more than `opencode-go/kimi-k2.5` via Go subscription ($10/mo flat)
17
+
* The ADF fleet dispatches hundreds of requests daily -- per-request markup compounds rapidly
18
+
* All required models are available through subscription providers at predictable monthly costs
19
+
* Subscription providers already connected and verified in local `auth.json`
20
+
21
+
## Considered Options
22
+
23
+
***Option A**: Continue using `opencode/` (Zen) prefix for convenience
24
+
***Option B**: Ban `opencode/` prefix, use subscription providers only
25
+
***Option C**: Run local inference to avoid all provider costs
26
+
27
+
## Decision Outcome
28
+
29
+
**Chosen option**: Option B -- Ban `opencode/` prefix, subscription providers only
30
+
31
+
**Reasoning**: All required models (kimi-k2.5, glm-5, minimax-m2.5, k2p5) are available through subscription providers at predictable flat-rate costs. The Go subscription alone ($10/mo) covers 4 models with ~100K requests/mo for minimax. Adding a runtime guard in `terraphim_spawner` prevents accidental use of the expensive Zen proxy.
32
+
33
+
### Positive Consequences
34
+
35
+
* Predictable monthly costs across all providers
36
+
* No risk of unexpected per-request charges
37
+
* Runtime guard catches configuration errors before they incur cost
0 commit comments