skill-iter-tune

Iterative skill tuning via execute-evaluate-improve feedback loop. Uses ccw cli Claude to execute skill, Gemini to evaluate quality, and Agent to apply improvements. Iterates until quality threshold or max iterations. Triggers on "skill iter tune", "iterative skill tuning", "tune skill".

Quality

73%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Critical

Do not install without reviewing

Optimize this skill with Tessl

npx tessl skill review --optimize ./.claude/skills/skill-iter-tune/SKILL.md

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong skill description that clearly communicates a specific, well-defined capability with explicit trigger terms. It names the concrete tools and workflow steps involved (execute via ccw cli Claude, evaluate via Gemini, improve via Agent) and provides clear 'when to use' guidance. The description is concise yet comprehensive, making it easy for Claude to distinguish this skill from others.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: execute skill via ccw cli Claude, evaluate quality via Gemini, apply improvements via Agent, and iterate until quality threshold or max iterations. The execute-evaluate-improve feedback loop is clearly described.	3 / 3
Completeness	Clearly answers both 'what' (iterative skill tuning via execute-evaluate-improve feedback loop using specific tools) and 'when' (explicit triggers listed: 'skill iter tune', 'iterative skill tuning', 'tune skill').	3 / 3
Trigger Term Quality	Includes explicit trigger phrases users would say: 'skill iter tune', 'iterative skill tuning', 'tune skill'. These are natural terms a user would use when wanting this specific functionality.	3 / 3
Distinctiveness Conflict Risk	Highly distinctive with a very specific niche: iterative skill tuning using a particular multi-agent pipeline (ccw cli Claude, Gemini, Agent). The trigger terms are unique and unlikely to conflict with other skills.	3 / 3
	Total	12 / 12 Passed

Implementation

47%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill has excellent workflow clarity with well-defined phases, termination conditions, and error recovery, making the complex iterative process navigable. However, it is severely over-verbose — the orchestrator document contains extensive implementation pseudocode, ASCII diagrams, and detailed patterns that inflate token cost without proportional value. The content would benefit from aggressive trimming, moving implementation details into the referenced phase files and keeping SKILL.md as a lean coordination overview.

Suggestions

Reduce the SKILL.md to a lean orchestrator overview (~80-100 lines) by moving termination logic, TodoWrite patterns, error handling tables, and coordinator checklists into the phase reference files or a separate REFERENCE.md.

Remove or drastically simplify the ASCII architecture diagrams — replace with a 3-4 line textual description of the execute→evaluate→improve loop.

Move the full Interactive Preference Collection JavaScript block into phases/01-setup.md since it's setup-phase implementation detail, and summarize it in one line in SKILL.md.

Consolidate the two separate data flow representations (Architecture Overview diagram + Data Flow section) into a single concise representation to eliminate redundancy.

Dimension	Reasoning	Score
Conciseness	Extremely verbose at ~300+ lines. Contains extensive architecture diagrams, JavaScript pseudocode for orchestration logic, TodoWrite patterns, and coordinator checklists that could be dramatically condensed. Much of this (termination logic, error handling tables, data flow diagrams) is implementation detail that bloats the orchestrator document. The ASCII art diagrams, while visually appealing, consume significant tokens for information that could be stated in a few lines.	1 / 3
Actionability	Provides concrete pseudocode for the iteration loop, termination logic, and preference collection, plus specific CLI commands (ccw cli flags). However, most code is illustrative JavaScript pseudocode rather than executable code — the actual execution details are deferred to phase files (phases/01-setup.md through phases/05-report.md) which are not provided. Key details like exact ccw cli invocation syntax and artifact handling are incomplete.	2 / 3
Workflow Clarity	The multi-step workflow is exceptionally well-sequenced with clear phase ordering (1→2→3→4→5), explicit termination conditions (quality threshold, max iterations, convergence, error limit), error recovery strategies per phase with retry/rollback, and a coordinator checklist with validation checkpoints. The iteration loop has clear feedback loops (evaluate → improve → re-execute).	3 / 3
Progressive Disclosure	References five phase documents (phases/01-05) with a clear table mapping phase to document and purpose, which is good structure. However, since no bundle files are provided, we cannot verify these references exist. More importantly, the SKILL.md itself contains too much inline detail (full termination logic, TodoWrite patterns, error handling tables, coordinator checklists) that should arguably live in the phase files or a separate reference, making the overview document bloated rather than being a lean orchestrator overview.	2 / 3
	Total	8 / 12 Passed

Validation

90%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 10 / 11 Passed

Validation for skill structure

Criteria	Description	Result
allowed_tools_field	'allowed-tools' contains unusual tool name(s)	Warning

	Total	10 / 11 Passed

Repository: catlog22/Claude-Code-Workflow
Commit: 5ff5e86

Reviewed: 16 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.