Cross-language testing strategies and patterns. Triggers on: test pyramid, unit test, integration test, e2e test, TDD, BDD, test coverage, mocking strategy, test doubles, test isolation.
67
51%
Does it follow best practices?
Impact
92%
1.17xAverage score across 3 eval scenarios
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./data/skills-md/0xdarkmatter/claude-mods/testing-patterns/SKILL.mdQuality
Discovery
64%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description has strong trigger term coverage with many natural keywords users would use, but it lacks specificity about what concrete actions the skill performs. The 'what' is too abstract ('strategies and patterns') and the 'when' relies on a trigger list rather than explicit scenario guidance. Adding concrete actions and a 'Use when...' clause would significantly improve it.
Suggestions
Replace the vague 'strategies and patterns' with specific actions like 'Recommends test architectures, generates test templates, advises on mocking strategies, and designs test pyramids across multiple programming languages.'
Add an explicit 'Use when...' clause describing user scenarios, e.g., 'Use when the user asks for help structuring tests, choosing between testing approaches, or improving test coverage in any programming language.'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain ('cross-language testing strategies and patterns') and lists relevant concepts, but does not describe concrete actions the skill performs (e.g., 'generates test suites', 'recommends mocking strategies', 'creates test plans'). It reads more like a topic list than a capability description. | 2 / 3 |
Completeness | The 'when' is partially addressed via the 'Triggers on:' clause listing trigger terms, but there is no explicit 'Use when...' guidance explaining the scenarios or user intents. The 'what' is weak—it says 'strategies and patterns' without specifying what actions the skill performs. The trigger list partially compensates but doesn't fully replace an explicit 'when' clause. | 2 / 3 |
Trigger Term Quality | The description includes a strong set of natural keywords users would actually say: 'unit test', 'integration test', 'e2e test', 'TDD', 'BDD', 'test coverage', 'mocking strategy', 'test doubles', 'test isolation', and 'test pyramid'. These cover common variations well. | 3 / 3 |
Distinctiveness Conflict Risk | While the testing domain is reasonably specific, 'cross-language testing strategies' is broad enough to potentially overlap with language-specific testing skills or general software development skills. The trigger terms help narrow it but terms like 'unit test' or 'test coverage' could easily match other testing-related skills. | 2 / 3 |
Total | 9 / 12 Passed |
Implementation
37%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill reads more like a textbook summary of testing concepts than an actionable skill for Claude. Most of the content (test pyramid, test types, AAA pattern, test doubles) is knowledge Claude already possesses, making the token expenditure inefficient. The lack of concrete workflows, executable examples beyond one Python snippet, and missing bundle files significantly limit its practical value.
Suggestions
Remove or drastically condense sections covering concepts Claude already knows (test pyramid, test types definitions, AAA pattern, test doubles taxonomy) and focus on project-specific conventions or non-obvious guidance.
Add a concrete TDD workflow with explicit steps, validation checkpoints, and decision points (e.g., when to write which type of test, how to iterate on failing tests).
Replace pseudocode in database isolation and external service isolation sections with executable, copy-paste-ready examples in at least one language.
Either provide the referenced bundle files (tdd-workflow.md, mocking-strategies.md, etc.) or remove the references to avoid broken links.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is reasonably organized but includes several things Claude already knows well—the test pyramid concept, what unit/integration/E2E tests are, the AAA pattern, and the test doubles taxonomy are all standard knowledge. The ASCII art pyramid and the descriptive tables add tokens without adding novel insight. However, it's not egregiously verbose and serves as a quick reference. | 2 / 3 |
Actionability | The skill provides some concrete guidance (naming conventions, AAA example in Python, test doubles table) but much of it is descriptive rather than executable. The database isolation and external service isolation sections use pseudocode/descriptions rather than concrete commands. The Python AAA example is the only truly executable snippet. | 2 / 3 |
Workflow Clarity | There is no clear multi-step workflow or sequenced process. The content is a reference catalog of testing concepts rather than a guide for how to approach testing a codebase. There are no validation checkpoints, feedback loops, or decision trees for choosing between strategies. For a skill covering TDD and testing strategies, a workflow showing the TDD cycle or how to decide test type would be expected. | 1 / 3 |
Progressive Disclosure | The skill references four files under 'Additional Resources' and one script, which is good structure. However, no bundle files are provided, so these references are broken/unverifiable. The main content itself could benefit from moving some of the reference material (test doubles table, isolation strategies) into separate files to keep the overview leaner. | 2 / 3 |
Total | 7 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
25d5403
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.