Optimize token usage and context management. Use when sessions feel slow, context is degraded, or you're running out of budget.
63
53%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/context-optimizer/SKILL.mdQuality
Discovery
57%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description has good structural completeness with explicit 'Use when' triggers, but suffers from vague capability language that doesn't specify concrete actions the skill teaches. The trigger terms are reasonable but could be expanded with more natural user language variations. The lack of specific actions makes it hard to distinguish from other optimization-related skills.
Suggestions
Replace 'Optimize token usage and context management' with specific concrete actions, e.g., 'Summarize conversation history, prune unused context, compress prompts, and manage context window limits to reduce token consumption.'
Expand trigger terms to include common user phrases like 'token limit', 'context window full', 'too expensive', 'long conversation', 'rate limit', or 'running out of context'.
Clarify what 'context is degraded' means with more specific symptoms users would recognize, such as 'Claude is forgetting earlier instructions' or 'responses are losing coherence'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description uses vague language like 'optimize token usage and context management' without listing any concrete actions. What specific steps does this skill teach? There are no actionable capabilities described (e.g., 'summarize conversation history', 'prune unused context', 'compress prompts'). | 1 / 3 |
Completeness | The description does answer both 'what' (optimize token usage and context management) and 'when' (sessions feel slow, context is degraded, or running out of budget) with explicit trigger conditions. Despite the vagueness of the 'what', the structure is complete. | 3 / 3 |
Trigger Term Quality | Includes some relevant terms like 'token usage', 'context', 'budget', and 'slow', which users might naturally mention. However, it misses common variations like 'rate limit', 'token limit', 'context window', 'long conversation', 'expensive', 'cost', or 'truncated'. | 2 / 3 |
Distinctiveness Conflict Risk | 'Context management' and 'optimize' are somewhat broad terms that could overlap with performance tuning, prompt engineering, or cost optimization skills. The triggers 'sessions feel slow' and 'running out of budget' add some specificity but 'context is degraded' is vague enough to conflict with other skills. | 2 / 3 |
Total | 8 / 12 Passed |
Implementation
50%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill provides a solid reference card for context management with good use of tables and structured formatting. Its main weaknesses are that it mixes genuinely useful project-specific guidance (auto-compaction config, MCP limits, CLAUDE.md sizing) with behavioral advice Claude likely already knows (don't be sycophantic, read before write), and several sections lack the concrete executable detail needed for top actionability scores. The document would benefit from tighter focus on the non-obvious, actionable items.
Suggestions
Remove or drastically shorten the 'Output Reduction' and 'Behavioral Efficiency' subsections—these describe behaviors Claude should already know or that belong in system-level configuration, not a skill file.
Add concrete syntax/examples for subagent delegation (e.g., how to actually invoke a subagent with a specific task) rather than just listing use cases.
Add a verification step to the 'When Context Is Degraded' fix workflow—e.g., 'Run /context again to confirm usage dropped below 70%' after compaction.
Consider splitting 'Token Efficiency' and 'CLAUDE.md Optimization' into separate reference files and linking to them from the main skill, keeping SKILL.md as a concise overview.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | Generally efficient with good use of tables and bullets, but some sections like 'Prompt Engineering for Token Efficiency' and 'Subagent Delegation' explain concepts Claude likely already understands. The 'Token Efficiency > Output Reduction' section is somewhat ironic—telling Claude not to be sycophantic is something it should already know from system prompts. Some items like 'ASCII only: -- not em dashes' are useful project-specific conventions though. | 2 / 3 |
Actionability | Provides concrete commands (/compact, /context, /mcp, /resume) and a specific JSON config snippet, which is good. However, much of the content is advisory rather than executable—'use subagents for exploration,' 'scope your prompts,' 'one-pass coding' are behavioral guidelines rather than concrete, copy-paste-ready instructions. The subagent delegation section lacks any actual syntax for how to invoke a subagent. | 2 / 3 |
Workflow Clarity | The 'Quick Diagnosis' section provides a clear 3-step triage workflow, and the 'Context Budget Planning' table gives phase-based guidance. However, the 'When Context Is Degraded' fix steps lack validation checkpoints—there's no way to verify that compaction actually helped before moving to the next step. The overall document reads more as a reference card than a sequenced workflow. | 2 / 3 |
Progressive Disclosure | Content is well-organized with clear headers and sections, making it easy to scan. However, the document is fairly long (~100 lines of content) and could benefit from splitting detailed sections (Token Efficiency, CLAUDE.md Optimization) into separate reference files. There are no cross-references to external files for deeper dives on any topic. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
9df6af8
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.