Describe the feature or problem you'd like to solve
Gemma 4 like feature
Proposed solution
Looking like Gemma feature, Please see below benefits that will help Github as well
0-Token Intent Classification
Every time you type a prompt, Gemini CLI must decide if it needs the "Big Brain" (Gemini Pro) or the "Fast Brain" (Gemini Flash).
- Without Gemma 3: The CLI sends your prompt to the cloud just to ask "Is this a hard question?" (Costs tokens/latency).
- With Gemma 3: It decides locally on your machine in milliseconds. Cost: 0 tokens.
- Offloading "Boring" Tasks
Gemma 3 identifies tasks that don't need expensive AI:
- SQL Formatting: If you ask to format ERPCRP2.sql, Gemma routes this to a local formatter or a tiny model.
- File Reading: "List all columns in this CSV" is handled by the "Fast Brain" (Flash) because Gemma flagged it as simple.
- Result: You save your Gemini Pro quota for the hard stuff—like tracing that htimObjMgr log or debugging integration errors.
Example prompts or workflows
No response
Additional context
No response
Describe the feature or problem you'd like to solve
Gemma 4 like feature
Proposed solution
Looking like Gemma feature, Please see below benefits that will help Github as well
0-Token Intent Classification
Every time you type a prompt, Gemini CLI must decide if it needs the "Big Brain" (Gemini Pro) or the "Fast Brain" (Gemini Flash).
Gemma 3 identifies tasks that don't need expensive AI:
Example prompts or workflows
No response
Additional context
No response