All AI CLI tools × available models × required subscriptions × Bloom max capability. Reference table for choosing which models to use in multi-agent-shogun. Trigger: "model list", "what models", "model comparison", "which models can I use", "モデル一覧", "モデル比較", "どのモデルが使える"
86
83%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Displays a complete reference table of all AI CLI tools, models, required subscriptions,
and maximum Bloom cognitive level per model. Use this before configuring capability_tiers
in config/settings.yaml.
/shogun-bloom-config to understand the landscapeOutput the reference tables below directly to the user. No tool calls required.
| Level | Category | Task Examples |
|---|---|---|
| L1 | Remember | File copy, template apply, data format |
| L2 | Understand | Summarize, explain, translate |
| L3 | Apply | Implement known patterns, generate boilerplate |
| L4 | Analyze | Debug, code review, root cause analysis |
| L5 | Evaluate | Architecture review, design trade-off judgment |
| L6 | Create | Novel architecture, requirements design, strategy |
| Plan | Monthly | Opus 4.6 | Sonnet 4.6 | Haiku 4.5 | Extended Thinking |
|---|---|---|---|---|---|
| Free | $0 | ✗ | ✓ | ✓ | ✗ |
| Pro | $20 | ✓ | ✓ | ✓ | ✓ |
| Max 5x | $100 | ✓ | ✓ | ✓ | ✓ |
| Max 20x | $200 | ✓ | ✓ | ✓ | ✓ |
Pro/Max 5x/Max 20x have the same model access. The difference is usage quota (5x/20x = multiplier of Pro).
| Model | Bloom Max | Best For | Notes |
|---|---|---|---|
claude-haiku-4-5-20251001 | L3 | High-volume L1-L3 tasks, fast responses | $1/$5/M; SWE-bench 73.3% (4pp below Sonnet 4.5); extended thinking available |
claude-sonnet-4-6 | L5 | Code review, analysis, orchestration | Best balance — $3/$15/M; SWE-bench 79.6%, 1M context |
claude-opus-4-6 | L6 | Novel design, strategy, architecture | $5/$25/M; SWE-bench 80.8% (only 1.2pp above Sonnet 4.6); use for true L6 only |
Extended Thinking (available Pro+): Adds ~1 Bloom level of effective capability on complex reasoning tasks.
| Agent | Recommended Model | Bloom Use | Reason |
|---|---|---|---|
| Shogun (You) | claude-opus-4-6 | L6 | Strategic decisions, final review |
| Karo (Manager) | claude-sonnet-4-6 | L4-L5 | Task orchestration; Opus is overkill here |
| Gunshi (Strategist) | claude-opus-4-6 | L5-L6 | Deep QC, architecture evaluation |
| Ashigaru 1–7 | Configured via capability_tiers | L1-L3 | Workers — routed by Bloom level |
| Plan | Monthly | Spark | gpt-5.3-codex | codex-mini | codex-max |
|---|---|---|---|---|---|
| Free / Go ($8) | $0–$8 | ✗ | ✗ (limited) | ✗ | ✗ |
| Plus | $20 | ✗ (Pro only) | ✓ | ✓ | ✓ |
| Pro | $200 | ✓ | ✓ | ✓ | ✓ |
gpt-5.3-codex-spark requires ChatGPT Pro ($200). ChatGPT Plus ($20) does NOT include Spark.
| Model | Bloom Max | Best For | Notes |
|---|---|---|---|
gpt-5.3-codex-spark | L3 | High-volume L1-L3 tasks at 1000+ tok/sec | Separate quota from gpt-5.3-codex; blazing fast |
gpt-5-codex-mini | L2 | Minimal quota usage for trivial tasks | Lightweight alternative to Spark |
gpt-5.3-codex | L4 | Analysis, debugging, code review | Standard workhorse |
gpt-5.1-codex-max | L5 | Complex analysis, design evaluation | Highest Codex capability |
L6 gap: No Codex model reliably handles novel creative design (L6). For L6 tasks, Claude Opus is recommended.
| Model | CLI | Bloom Max | Min Subscription | Notes |
|---|---|---|---|---|
gpt-5-codex-mini | Codex CLI | L2 | ChatGPT Plus | Lightweight, minimal quota |
claude-haiku-4-5-20251001 | Claude Code | L3 | Claude Free | Best Claude cost-efficiency; SWE-bench 73.3% |
gpt-5.3-codex-spark | Codex CLI | L3 | ChatGPT Pro | 1000+ tok/s; Terminal-Bench 58.4% |
gpt-5.3-codex | Codex CLI | L4 | ChatGPT Plus | Terminal-Bench 77.3%; 400K+ context |
claude-sonnet-4-6 | Claude Code | L5 | Claude Free | $3/$15/M; SWE-bench 79.6%; 1M context; math +27pt vs Sonnet 4.5 |
gpt-5.1-codex-max | Codex CLI | L5 | ChatGPT Plus | Highest Codex capability |
claude-opus-4-6 | Claude Code | L6 | Claude Pro | $5/$25/M; SWE-bench 80.8%; reserve for true L6 tasks |
To generate a ready-to-paste capability_tiers YAML for your subscription:
/shogun-bloom-configOr tell the Shogun: "set up capability tiers for my subscription"
3dafe0a
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.