workflow-test-fix

Unified test-fix pipeline combining test generation (session, context, analysis, task gen) with iterative test-cycle execution (adaptive strategy, progressive testing, CLI fallback). Triggers on "workflow-test-fix", "test fix workflow".

Quality

35%

Does it follow best practices?

Run evals on this skill

Adds up to 20 points to the overall score

View guide

Securityby

Critical

Do not install without reviewing

Fix and improve this skill with Tessl

tessl review fix ./.claude/skills/workflow-test-fix/SKILL.md

Quality

Content

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill attempts to be a comprehensive orchestrator document but suffers from severe verbosity and redundancy—the same 5-phase pipeline is described in at least 4 different representations (architecture diagram, execution flow, data flow, coordinator checklist). While the workflow structure is well-conceived with progressive phase loading and adaptive strategies, the SKILL.md contradicts its own 'pure coordinator' principle by inlining extensive implementation details that belong in phase files. The lack of truly executable code and the absence of bundle files further weaken its practical utility.

Suggestions

Reduce redundancy by consolidating the architecture diagram, execution flow, data flow, and coordinator checklist into a single authoritative representation—pick one format and remove the others.

Move implementation details (TodoWrite JSON examples, session file structure, error handling tables, strategy engine specifics) into the respective phase files, keeping SKILL.md as the lean coordinator it claims to be.

Replace the conceptual JavaScript snippet in Section 4 with actual tool invocation syntax that Claude can execute, or remove it entirely if the phase files handle preference collection.

Add concrete validation commands in the coordinator checklist (e.g., specific file existence checks, JSON schema validation) rather than vague 'Verify X created' instructions.

Dimension	Reasoning	Score
Conciseness	Extremely verbose at ~400+ lines with massive redundancy. The architecture diagram, data flow, execution flow, and coordinator checklist all describe the same pipeline multiple times. The TodoWrite pattern section shows three full JSON examples that repeat similar information. Much content (e.g., explaining what PDF-like concepts are to Claude, the detailed ASCII diagrams) could be drastically condensed. The summary output template in Section 10 is repeated conceptually in multiple other sections.	1 / 3
Actionability	The skill provides some concrete guidance—file paths, phase document references, JSON structures, and CLI flags—but most 'code' is pseudocode or illustrative rather than executable. The JavaScript snippet in Section 4 is conceptual (AskUserQuestion isn't real JS), and the actual execution relies entirely on phase files that aren't provided. The skill describes what to do at a high level but delegates all real execution detail to external phase files.	2 / 3
Workflow Clarity	The multi-step pipeline is clearly sequenced (Phase 1→5) with explicit phase transitions and the coordinator checklist provides verification steps. However, validation checkpoints within phases are vague ('Verify TEST_ANALYSIS_RESULTS.md created'), error recovery is listed in tables but lacks concrete feedback loops (e.g., what exactly happens on retry), and the fix loop's validate→fix→retry cycle is described abstractly rather than with explicit validation commands. The redundant presentation of the same flow in 4+ different formats actually hurts clarity.	2 / 3
Progressive Disclosure	The skill correctly references 5 phase files (phases/01-05) for progressive loading, and the phase reference table is well-structured. However, no bundle files are provided, so we can't verify the references resolve. More critically, the SKILL.md itself is a monolithic wall of text that inlines enormous amounts of detail (TodoWrite patterns, session file structure, full error handling tables, strategy engine details) that should be in the phase files or separate references, undermining the stated 'pure coordinator' design principle.	2 / 3
	Total	7 / 12 Passed

Description

35%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description attempts to convey a complex multi-stage testing workflow but relies heavily on internal jargon and parenthetical feature lists rather than plain, concrete actions. The trigger terms are artificial command strings rather than natural user language, significantly reducing discoverability. The description would benefit from plain-language actions and realistic usage scenarios.

Suggestions

Replace artificial trigger terms ('workflow-test-fix') with natural user phrases like 'fix failing tests', 'generate and run tests', 'debug test failures', 'iteratively fix broken tests'.

Add a clear 'Use when...' clause describing real scenarios, e.g., 'Use when the user wants to automatically generate tests, run them, and iteratively fix failures until tests pass.'

Replace jargon-heavy parentheticals ('session, context, analysis, task gen', 'adaptive strategy, progressive testing, CLI fallback') with concrete plain-language actions like 'analyzes code context, generates test cases, runs tests, and iteratively fixes failures'.

Dimension	Reasoning	Score
Specificity	It names the domain (test generation and execution) and mentions some actions like 'session, context, analysis, task gen' and 'adaptive strategy, progressive testing, CLI fallback', but these read more like internal jargon/feature labels than concrete user-facing actions. It doesn't clearly list what specific things it does in plain language.	2 / 3
Completeness	It describes 'what' (a unified test-fix pipeline combining generation and execution) and has explicit trigger terms, but the 'when' guidance is limited to artificial trigger phrases rather than describing real scenarios when Claude should select this skill. The trigger clause exists but is not meaningful.	2 / 3
Trigger Term Quality	The trigger terms 'workflow-test-fix' and 'test fix workflow' are artificial command-like phrases, not natural language a user would say. Missing natural terms like 'fix failing tests', 'run tests', 'debug test failures', 'generate tests', etc.	1 / 3
Distinctiveness Conflict Risk	The description is somewhat specific to a test-fix pipeline workflow, which narrows the domain, but the jargon-heavy language and vague sub-components (e.g., 'adaptive strategy', 'progressive testing') could overlap with other testing-related skills without clear differentiation.	2 / 3
	Total	7 / 12 Passed

Validation

90%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 10 / 11 Passed

Validation for skill structure

Criteria	Description	Result
allowed_tools_field	'allowed-tools' contains unusual tool name(s)	Warning

	Total	10 / 11 Passed

Repository: catlog22/Claude-Code-Workflow
Path: .claude/skills/workflow-test-fix/SKILL.md
Commit: 07491b0

Reviewed: about 12 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.