Iterative skill tuning via execute-evaluate-improve feedback loop. Uses ccw cli Claude to execute skill, Gemini to evaluate quality, and Agent to apply improvements. Iterates until quality threshold or max iterations. Triggers on "skill iter tune", "iterative skill tuning", "tune skill".
78
73%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Critical
Do not install without reviewing
Optimize this skill with Tessl
npx tessl skill review --optimize ./.claude/skills/skill-iter-tune/SKILL.mdQuality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong description that clearly communicates a specific multi-agent workflow for iterative skill tuning. It names concrete tools and roles (ccw cli Claude for execution, Gemini for evaluation, Agent for improvements), provides explicit trigger phrases, and describes the termination conditions. The description is concise yet comprehensive, making it easy for Claude to distinguish this skill from others.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: execute skill via ccw cli Claude, evaluate quality via Gemini, apply improvements via Agent, and iterate until quality threshold or max iterations. The execute-evaluate-improve feedback loop is clearly described. | 3 / 3 |
Completeness | Clearly answers both 'what' (iterative skill tuning via execute-evaluate-improve feedback loop using specific tools) and 'when' (explicit triggers listed: 'skill iter tune', 'iterative skill tuning', 'tune skill'). | 3 / 3 |
Trigger Term Quality | Includes explicit trigger phrases users would say: 'skill iter tune', 'iterative skill tuning', 'tune skill'. These are natural terms a user would use when wanting this specific workflow. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive with a very specific niche: iterative skill tuning using a particular multi-agent pipeline (ccw cli Claude, Gemini, Agent). The trigger terms are unique and unlikely to conflict with other skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
47%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill demonstrates excellent workflow design with clear phase sequencing, termination logic, and error recovery, but suffers significantly from verbosity — it includes extensive inline implementation detail (architecture diagrams, pseudocode, TodoWrite patterns) that should either be in referenced phase files or dramatically condensed. The actionability is moderate since the code is illustrative rather than executable, and the actual execution logic is deferred to phase files that aren't provided in the bundle.
Suggestions
Reduce the SKILL.md to a concise orchestrator overview (~80-100 lines) by moving the TodoWrite pattern, termination logic pseudocode, and coordinator checklist into the referenced phase files or a separate implementation-details.md
Replace the two large ASCII architecture diagrams with a brief 3-4 line textual description of the flow (execute→evaluate→improve loop until threshold/max/convergence)
Include the actual bundle phase files (phases/01-setup.md through phases/05-report.md) so the progressive disclosure structure is complete and the skill is fully actionable
Make the ccw cli invocations concrete with actual executable command examples rather than pseudocode placeholders like `ccw cli -p "..." --tool claude --mode write`
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | Extremely verbose at ~300+ lines. Contains extensive architecture diagrams, JavaScript pseudocode for orchestration logic, TodoWrite patterns, and coordinator checklists that could be dramatically condensed. Much of this (termination logic, error handling tables, data flow diagrams) is implementation detail that bloats the orchestrator document. The ASCII art diagrams, while visually appealing, consume significant tokens for information that could be stated in a few lines. | 1 / 3 |
Actionability | Provides concrete pseudocode for the iteration loop, termination logic, and preference collection, but none of it is truly executable — it's illustrative JavaScript that references undefined functions (AskUserQuestion, TaskCreate, TaskUpdate) and tools (ccw cli) without complete invocation syntax. The actual execution details are deferred to phase files that aren't provided, making the skill partially actionable but incomplete on its own. | 2 / 3 |
Workflow Clarity | The multi-step workflow is exceptionally well-sequenced with clear phase ordering (1→2→3→4→5), explicit termination conditions (threshold, max iterations, convergence, error limit), error recovery strategies per phase with retry/rollback, and a coordinator checklist. Validation checkpoints are built into the loop (Phase 3 evaluation before proceeding, snapshot before execute, convergence detection). | 3 / 3 |
Progressive Disclosure | The skill correctly references phase files (phases/01-setup.md through phases/05-report.md) with a clear table and one-level-deep references, which is good structure. However, no bundle files are provided, so the referenced phase documents don't exist for evaluation. Additionally, the SKILL.md itself contains too much inline detail (full pseudocode for termination, TodoWrite patterns, error handling) that should live in the phase files, undermining the progressive disclosure principle. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
Total | 10 / 11 Passed | |
227244f
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.