CtrlK
BlogDocsLog inGet started
Tessl Logo

workflow-test-fix

Unified test-fix pipeline combining test generation (session, context, analysis, task gen) with iterative test-cycle execution (adaptive strategy, progressive testing, CLI fallback). Triggers on "workflow-test-fix", "test fix workflow".

47

Quality

35%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Critical

Do not install without reviewing

Optimize this skill with Tessl

npx tessl skill review --optimize ./.claude/skills/workflow-test-fix/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

35%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description attempts to cover a complex workflow but relies heavily on internal implementation terminology (session, context, analysis, task gen, adaptive strategy, progressive testing, CLI fallback) rather than describing concrete user-facing capabilities in plain language. The trigger terms are overly specific command phrases rather than natural language a user would employ, and the description lacks clear guidance on when Claude should choose this skill beyond exact phrase matching.

Suggestions

Replace jargon-heavy parenthetical lists with concrete user-facing actions, e.g., 'Generates unit tests, runs them, identifies failures, and iteratively fixes code until tests pass'.

Add natural trigger terms users would actually say, such as 'fix failing tests', 'debug test errors', 'generate and run tests', 'make tests pass', 'test suite failures'.

Expand the 'when' clause beyond exact command phrases to describe scenarios, e.g., 'Use when the user wants to generate tests for their code and iteratively fix failures until all tests pass'.

DimensionReasoningScore

Specificity

It names the domain (test generation and execution) and mentions some actions like 'session, context, analysis, task gen' and 'adaptive strategy, progressive testing, CLI fallback', but these are more like internal implementation details than concrete user-facing actions. The parenthetical lists read as jargon rather than clear capabilities.

2 / 3

Completeness

The 'what' is partially addressed (test generation and iterative test-cycle execution), and there is a 'Triggers on' clause that serves as a 'when', but the triggers are narrow command phrases rather than meaningful contextual guidance about when Claude should select this skill.

2 / 3

Trigger Term Quality

The trigger terms 'workflow-test-fix' and 'test fix workflow' are very specific command-like phrases that users would not naturally say. Natural terms like 'fix failing tests', 'run tests', 'debug test failures', or 'generate tests' are absent. The description relies on technical jargon like 'adaptive strategy' and 'progressive testing' rather than natural user language.

1 / 3

Distinctiveness Conflict Risk

The combination of test generation and test-fix pipeline is somewhat specific, but the broad terms 'test generation' and 'test execution' could overlap with other testing-related skills. The narrow trigger phrases help reduce conflict but at the cost of discoverability.

2 / 3

Total

7

/

12

Passed

Implementation

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill is an ambitious orchestration document that suffers severely from verbosity and redundancy—the same 5-phase pipeline is described in at least 4 different representations (ASCII diagram, execution flow, data flow, coordinator checklist). While the workflow structure is well-thought-out with quality gates and adaptive strategies, the SKILL.md tries to be both a high-level coordinator and a detailed reference document, resulting in a ~400-line file that violates its own 'pure orchestrator' principle. Much of the inline detail (error tables, TodoWrite JSON examples, session file structure) should live in the referenced phase files.

Suggestions

Eliminate redundant pipeline descriptions: keep only the Execution Flow (Section 6) and Coordinator Checklist (Section 20), removing the Architecture Overview diagram, Data Flow section, and Summary Output template which all repeat the same information.

Move detailed content to phase files: error handling tables (Section 16), TodoWrite JSON examples (Section 14), session file structure (Section 15), and strategy engine details (Section 12) should live in their respective phase documents, with SKILL.md containing only one-line summaries.

Reduce the SKILL.md to under 100 lines by keeping only: entry point routing, phase sequence with one-line descriptions, quality thresholds, core rules, and the coordinator checklist.

Provide the actual bundle phase files so the progressive disclosure model works—currently the skill references 5 phase files that don't exist in the bundle, making the orchestrator non-functional on its own.

DimensionReasoningScore

Conciseness

Extremely verbose at ~400+ lines with massive redundancy. The architecture diagram, data flow, execution flow, and coordinator checklist all describe the same pipeline multiple times. The TodoWrite pattern section shows three full JSON examples that repeat similar information. Much content (e.g., explaining what PDF-like concepts are to Claude, the detailed ASCII diagrams) could be dramatically condensed. The summary output template, session file structure, and error handling tables all add bulk that could live in phase files.

1 / 3

Actionability

Provides some concrete guidance (file paths, JSON structures, CLI flags, quality thresholds) but most content is architectural description rather than executable instructions. The JavaScript code block for preference collection is useful but is pseudocode-like (AskUserQuestion isn't a real API call shown with proper context). The actual execution is delegated to phase files that aren't provided, making the skill itself more of a design document than actionable instructions.

2 / 3

Workflow Clarity

The multi-phase sequence is clearly defined with Phase 1→5 progression, and there are validation checkpoints (quality gates, pass rate thresholds). However, the same workflow is described in at least 4 different ways (architecture diagram, execution flow, data flow, coordinator checklist) creating confusion rather than clarity. The fix loop has validation steps but the feedback loop details are deferred to Phase 5 docs. Missing explicit validation between phases (e.g., what happens if Phase 2 output is malformed before Phase 3).

2 / 3

Progressive Disclosure

The skill correctly references phase files (phases/01-session-start.md through phases/05-test-cycle-execute.md) with a clear one-level-deep structure and a phase reference table. However, no bundle files are provided, so we can't verify these references resolve. More critically, the SKILL.md itself contains enormous amounts of detail that should live in those phase files (error handling per phase, TodoWrite patterns, session file structure), undermining the progressive disclosure model it claims to follow.

2 / 3

Total

7

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

allowed_tools_field

'allowed-tools' contains unusual tool name(s)

Warning

Total

10

/

11

Passed

Repository
catlog22/Claude-Code-Workflow
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.