CtrlK
BlogDocsLog inGet started
Tessl Logo

shogun-model-list

All AI CLI tools × available models × required subscriptions × Bloom max capability. Reference table for choosing which models to use in multi-agent-shogun. Trigger: "model list", "what models", "model comparison", "which models can I use", "モデル一覧", "モデル比較", "どのモデルが使える"

86

Quality

83%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

/shogun-model-list — Model Capability Reference

Overview

Displays a complete reference table of all AI CLI tools, models, required subscriptions, and maximum Bloom cognitive level per model. Use this before configuring capability_tiers in config/settings.yaml.

When to Use

  • "What models can I use with my subscription?"
  • "Which model handles L5 tasks?"
  • "Compare Claude vs Codex model tiers"
  • "Show me all models" / "モデル一覧"
  • Before running /shogun-bloom-config to understand the landscape

Instructions

Output the reference tables below directly to the user. No tool calls required.


Bloom's Taxonomy — Quick Reference

LevelCategoryTask Examples
L1RememberFile copy, template apply, data format
L2UnderstandSummarize, explain, translate
L3ApplyImplement known patterns, generate boilerplate
L4AnalyzeDebug, code review, root cause analysis
L5EvaluateArchitecture review, design trade-off judgment
L6CreateNovel architecture, requirements design, strategy

Claude Code (Anthropic)

Subscription Plans

PlanMonthlyOpus 4.6Sonnet 4.6Haiku 4.5Extended Thinking
Free$0
Pro$20
Max 5x$100
Max 20x$200

Pro/Max 5x/Max 20x have the same model access. The difference is usage quota (5x/20x = multiplier of Pro).

Claude Models × Bloom Capability

ModelBloom MaxBest ForNotes
claude-haiku-4-5-20251001L3High-volume L1-L3 tasks, fast responses$1/$5/M; SWE-bench 73.3% (4pp below Sonnet 4.5); extended thinking available
claude-sonnet-4-6L5Code review, analysis, orchestrationBest balance — $3/$15/M; SWE-bench 79.6%, 1M context
claude-opus-4-6L6Novel design, strategy, architecture$5/$25/M; SWE-bench 80.8% (only 1.2pp above Sonnet 4.6); use for true L6 only

Extended Thinking (available Pro+): Adds ~1 Bloom level of effective capability on complex reasoning tasks.

Fixed Agent Assignments (Recommended)

AgentRecommended ModelBloom UseReason
Shogun (You)claude-opus-4-6L6Strategic decisions, final review
Karo (Manager)claude-sonnet-4-6L4-L5Task orchestration; Opus is overkill here
Gunshi (Strategist)claude-opus-4-6L5-L6Deep QC, architecture evaluation
Ashigaru 1–7Configured via capability_tiersL1-L3Workers — routed by Bloom level

OpenAI Codex CLI

Subscription Plans

PlanMonthlySparkgpt-5.3-codexcodex-minicodex-max
Free / Go ($8)$0–$8✗ (limited)
Plus$20✗ (Pro only)
Pro$200

gpt-5.3-codex-spark requires ChatGPT Pro ($200). ChatGPT Plus ($20) does NOT include Spark.

Codex Models × Bloom Capability

ModelBloom MaxBest ForNotes
gpt-5.3-codex-sparkL3High-volume L1-L3 tasks at 1000+ tok/secSeparate quota from gpt-5.3-codex; blazing fast
gpt-5-codex-miniL2Minimal quota usage for trivial tasksLightweight alternative to Spark
gpt-5.3-codexL4Analysis, debugging, code reviewStandard workhorse
gpt-5.1-codex-maxL5Complex analysis, design evaluationHighest Codex capability

L6 gap: No Codex model reliably handles novel creative design (L6). For L6 tasks, Claude Opus is recommended.


Capability Summary (All Models, Cross-CLI)

ModelCLIBloom MaxMin SubscriptionNotes
gpt-5-codex-miniCodex CLIL2ChatGPT PlusLightweight, minimal quota
claude-haiku-4-5-20251001Claude CodeL3Claude FreeBest Claude cost-efficiency; SWE-bench 73.3%
gpt-5.3-codex-sparkCodex CLIL3ChatGPT Pro1000+ tok/s; Terminal-Bench 58.4%
gpt-5.3-codexCodex CLIL4ChatGPT PlusTerminal-Bench 77.3%; 400K+ context
claude-sonnet-4-6Claude CodeL5Claude Free$3/$15/M; SWE-bench 79.6%; 1M context; math +27pt vs Sonnet 4.5
gpt-5.1-codex-maxCodex CLIL5ChatGPT PlusHighest Codex capability
claude-opus-4-6Claude CodeL6Claude Pro$5/$25/M; SWE-bench 80.8%; reserve for true L6 tasks

Next Step

To generate a ready-to-paste capability_tiers YAML for your subscription:

/shogun-bloom-config

Or tell the Shogun: "set up capability tiers for my subscription"

Repository
yohey-w/multi-agent-shogun
Last updated
Created

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.