Iterative skill tuning via execute-evaluate-improve feedback loop. Uses ccw cli Claude to execute skill, Gemini to evaluate quality, and Agent to apply improvements. Iterates until quality threshold or max iterations. Triggers on "skill iter tune", "iterative skill tuning", "tune skill".
62
73%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Critical
Do not install without reviewing
Optimize this skill with Tessl
npx tessl skill review --optimize ./.claude/skills/skill-iter-tune/SKILL.mdQuality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong skill description that clearly communicates a specific, well-defined capability with explicit trigger terms. It names the concrete tools and workflow steps involved (execute via ccw cli Claude, evaluate via Gemini, improve via Agent) and provides clear 'when to use' guidance. The description is concise yet comprehensive, making it easy for Claude to distinguish this skill from others.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: execute skill via ccw cli Claude, evaluate quality via Gemini, apply improvements via Agent, and iterate until quality threshold or max iterations. The execute-evaluate-improve feedback loop is clearly described. | 3 / 3 |
Completeness | Clearly answers both 'what' (iterative skill tuning via execute-evaluate-improve feedback loop using specific tools) and 'when' (explicit triggers listed: 'skill iter tune', 'iterative skill tuning', 'tune skill'). | 3 / 3 |
Trigger Term Quality | Includes explicit trigger phrases users would say: 'skill iter tune', 'iterative skill tuning', 'tune skill'. These are natural terms a user would use when wanting this specific functionality. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive with a very specific niche: iterative skill tuning using a particular multi-agent pipeline (ccw cli Claude, Gemini, Agent). The trigger terms are unique and unlikely to conflict with other skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
47%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill has excellent workflow clarity with well-defined phases, termination conditions, and error recovery, making the complex iterative process navigable. However, it is severely over-verbose — the orchestrator document contains extensive implementation pseudocode, ASCII diagrams, and detailed patterns that inflate token cost without proportional value. The content would benefit from aggressive trimming, moving implementation details into the referenced phase files and keeping SKILL.md as a lean coordination overview.
Suggestions
Reduce the SKILL.md to a lean orchestrator overview (~80-100 lines) by moving termination logic, TodoWrite patterns, error handling tables, and coordinator checklists into the phase reference files or a separate REFERENCE.md.
Remove or drastically simplify the ASCII architecture diagrams — replace with a 3-4 line textual description of the execute→evaluate→improve loop.
Move the full Interactive Preference Collection JavaScript block into phases/01-setup.md since it's setup-phase implementation detail, and summarize it in one line in SKILL.md.
Consolidate the two separate data flow representations (Architecture Overview diagram + Data Flow section) into a single concise representation to eliminate redundancy.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | Extremely verbose at ~300+ lines. Contains extensive architecture diagrams, JavaScript pseudocode for orchestration logic, TodoWrite patterns, and coordinator checklists that could be dramatically condensed. Much of this (termination logic, error handling tables, data flow diagrams) is implementation detail that bloats the orchestrator document. The ASCII art diagrams, while visually appealing, consume significant tokens for information that could be stated in a few lines. | 1 / 3 |
Actionability | Provides concrete pseudocode for the iteration loop, termination logic, and preference collection, plus specific CLI commands (ccw cli flags). However, most code is illustrative JavaScript pseudocode rather than executable code — the actual execution details are deferred to phase files (phases/01-setup.md through phases/05-report.md) which are not provided. Key details like exact ccw cli invocation syntax and artifact handling are incomplete. | 2 / 3 |
Workflow Clarity | The multi-step workflow is exceptionally well-sequenced with clear phase ordering (1→2→3→4→5), explicit termination conditions (quality threshold, max iterations, convergence, error limit), error recovery strategies per phase with retry/rollback, and a coordinator checklist with validation checkpoints. The iteration loop has clear feedback loops (evaluate → improve → re-execute). | 3 / 3 |
Progressive Disclosure | References five phase documents (phases/01-05) with a clear table mapping phase to document and purpose, which is good structure. However, since no bundle files are provided, we cannot verify these references exist. More importantly, the SKILL.md itself contains too much inline detail (full termination logic, TodoWrite patterns, error handling tables, coordinator checklists) that should arguably live in the phase files or a separate reference, making the overview document bloated rather than being a lean orchestrator overview. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
Total | 10 / 11 Passed | |
5ff5e86
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.