CtrlK
BlogDocsLog inGet started
Tessl Logo

tdd-guide

Test-driven development skill for writing unit tests, generating test fixtures and mocks, analyzing coverage gaps, and guiding red-green-refactor workflows across Jest, Pytest, JUnit, Vitest, and Mocha. Use when the user asks to write tests, improve test coverage, practice TDD, generate mocks or stubs, or mentions testing frameworks like Jest, pytest, or JUnit.

79

Quality

73%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./engineering-team/tdd-guide/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that hits all the marks. It provides specific concrete actions, comprehensive trigger terms that users would naturally use, a clear 'Use when...' clause, and a well-defined niche around testing and TDD. The description is concise yet thorough, using proper third-person voice throughout.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: writing unit tests, generating test fixtures and mocks, analyzing coverage gaps, and guiding red-green-refactor workflows. Also names specific frameworks (Jest, Pytest, JUnit, Vitest, Mocha).

3 / 3

Completeness

Clearly answers both 'what' (writing unit tests, generating fixtures/mocks, analyzing coverage gaps, guiding TDD workflows) and 'when' (explicit 'Use when...' clause listing multiple trigger scenarios including writing tests, improving coverage, practicing TDD, generating mocks, or mentioning specific frameworks).

3 / 3

Trigger Term Quality

Excellent coverage of natural terms users would say: 'write tests', 'test coverage', 'TDD', 'mocks', 'stubs', 'Jest', 'pytest', 'JUnit', 'testing frameworks'. These are all terms users would naturally use when seeking testing help.

3 / 3

Distinctiveness Conflict Risk

Clearly carved out niche around test-driven development and testing specifically. The mention of specific frameworks, TDD workflows, mocks/stubs, and coverage analysis makes it highly distinct and unlikely to conflict with general coding or other development skills.

3 / 3

Total

12

/

12

Passed

Implementation

47%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill has excellent workflow clarity with well-defined steps and validation checkpoints, and the bounded autonomy rules are a strong addition. However, it suffers significantly from verbosity — it tries to be a comprehensive TDD encyclopedia rather than a focused skill file, including tutorial-level explanations of concepts Claude already knows (mutation testing, property-based testing) and redundant examples across 4+ languages. The referenced tooling scripts appear fictional, reducing practical actionability.

Suggestions

Cut the property-based testing and mutation testing sections into separate reference files (e.g., PROPERTY_TESTING.md, MUTATION_TESTING.md) and link to them from the main skill — this alone would halve the file length.

Remove explanatory content Claude already knows (e.g., 'Why Mutation Testing Matters', 'When to Use Property-Based Over Example-Based') and replace with terse decision rules.

Clarify the tooling story — either explain where test_generator.py/coverage_analyzer.py come from (are they in the repo? need installation?) or reframe the skill to work without custom scripts.

Reduce language examples to 1-2 representative cases in the main file and move additional language examples to a LANGUAGE_EXAMPLES.md reference file.

DimensionReasoningScore

Conciseness

At ~300+ lines, this skill is extremely verbose. It includes extensive code examples across multiple languages (Go, TypeScript, Python), sections on property-based testing, mutation testing, and cross-references that bloat the content well beyond what's needed. Much of this (e.g., explaining what mutation testing is, why it matters, what property-based testing is) is knowledge Claude already has. The mutation testing and property-based testing sections are essentially tutorials rather than actionable skill instructions.

1 / 3

Actionability

The skill references specific scripts (test_generator.py, coverage_analyzer.py, tdd_workflow.py) with concrete CLI commands, and provides executable code examples. However, these scripts appear to be fictional/internal tools with no indication of where they come from or how to install them, making the commands not truly copy-paste ready. The code examples themselves are good but serve more as illustration than actionable instruction.

2 / 3

Workflow Clarity

The workflows are clearly sequenced with explicit validation checkpoints at each stage. The Generate Tests, Analyze Coverage, and TDD New Feature workflows all have numbered steps with bold **Validation** steps. The spec-first workflow includes a clear progression from spec → tests → implementation → refactor. The bounded autonomy rules provide clear decision points for when to stop and ask.

3 / 3

Progressive Disclosure

The skill references external skills via cross-references (e.g., engineering/spec-driven-workflow) which is good, but the main file itself is monolithic — property-based testing, mutation testing, and multi-language examples could easily be split into separate reference files. The content that should be in supplementary files is all inline, making the main skill file excessively long.

2 / 3

Total

8

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
alirezarezvani/claude-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.