Cross-language testing strategies and patterns. Triggers on: test pyramid, unit test, integration test, e2e test, TDD, BDD, test coverage, mocking strategy, test doubles, test isolation.
73
61%
Does it follow best practices?
Impact
92%
1.17xAverage score across 3 eval scenarios
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./data/skills-md/0xdarkmatter/claude-mods/testing-patterns/SKILL.mdQuality
Discovery
64%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description has strong trigger term coverage with a comprehensive list of testing-related keywords users would naturally use. However, it lacks specificity about what concrete actions the skill performs (advise on strategies? generate tests? refactor test code?) and the 'when' guidance is implicit through keyword listing rather than explicit scenario descriptions. Adding concrete actions and a 'Use when...' clause would significantly improve it.
Suggestions
Replace the vague 'strategies and patterns' with specific concrete actions, e.g., 'Advises on test architecture, generates unit/integration/e2e tests, designs mocking strategies, and recommends coverage improvements across languages.'
Add an explicit 'Use when...' clause describing scenarios, e.g., 'Use when the user asks about structuring tests, improving test coverage, choosing between testing approaches, or implementing test doubles.'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain ('cross-language testing strategies and patterns') and implies actions around testing, but does not list specific concrete actions like 'generate unit tests', 'create mock objects', or 'analyze test coverage'. It reads more like a topic label than a list of capabilities. | 2 / 3 |
Completeness | The 'what' is weakly stated ('cross-language testing strategies and patterns') and the 'when' is expressed via 'Triggers on:' with a keyword list rather than an explicit 'Use when...' clause describing scenarios. While the trigger list partially serves as 'when' guidance, it lacks explicit situational framing, capping this at 2. | 2 / 3 |
Trigger Term Quality | The description includes a strong set of natural trigger terms that users would actually say: 'test pyramid', 'unit test', 'integration test', 'e2e test', 'TDD', 'BDD', 'test coverage', 'mocking strategy', 'test doubles', 'test isolation'. These cover common variations well. | 3 / 3 |
Distinctiveness Conflict Risk | While the testing focus is clear, 'cross-language testing strategies' is broad enough to potentially overlap with language-specific testing skills or general code quality skills. The trigger terms help narrow it, but the lack of specificity about what it actually does (advise? generate? refactor?) creates some ambiguity. | 2 / 3 |
Total | 9 / 12 Passed |
Implementation
57%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill provides a reasonable reference overview of testing patterns with good structure and progressive disclosure to external resources. However, it spends significant tokens on concepts Claude already knows (test pyramid basics, test double definitions) and lacks executable, copy-paste-ready code for most sections. The content reads more like a reference card than actionable guidance for performing specific testing tasks.
Suggestions
Remove or drastically condense sections that explain concepts Claude already knows well (test pyramid, basic test type definitions, test double definitions) and focus tokens on project-specific conventions or non-obvious patterns.
Replace pseudocode blocks (database isolation, external service isolation) with executable code examples in at least one language, or provide concrete command-line examples.
Add a workflow sequence for implementing testing in a project—e.g., 'Start with unit tests → check coverage with ./scripts/coverage-check.sh → add integration tests for failing boundaries → validate CI passes' with explicit checkpoints.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is reasonably structured but includes several things Claude already knows well—the test pyramid ASCII art, basic definitions of unit/integration/E2E tests, and explanations of test doubles are all foundational knowledge. The 'What to Test' and 'Test Quality Checklist' sections are somewhat generic. However, it avoids excessive prose and uses tables/lists efficiently. | 2 / 3 |
Actionability | The AAA example is executable Python, and the naming convention is concrete. However, most other sections use pseudocode or plain-text descriptions rather than executable code (database isolation strategies, external service isolation, test doubles table uses pseudo-syntax). The skill describes patterns more than it instructs on how to implement them. | 2 / 3 |
Workflow Clarity | The content presents categories and options clearly but lacks a sequenced workflow for actually implementing a testing strategy. There are no validation checkpoints or feedback loops—for example, no guidance on 'run tests, check coverage, fix gaps, re-run.' The test quality checklist is helpful but is a static list rather than a process. | 2 / 3 |
Progressive Disclosure | The skill provides a clear overview with well-signaled one-level-deep references to additional resources (tdd-workflow.md, mocking-strategies.md, test-data-patterns.md, ci-testing.md) and a script reference. Content is appropriately split between the overview and referenced files. | 3 / 3 |
Total | 9 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
9f4534c
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.