Integration and E2E test design principles, ROI calculation, test skeleton specification, and review criteria. Use when designing integration tests, E2E tests, or reviewing test quality.
53
58%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/integration-e2e-testing/SKILL.mdQuality
Discovery
67%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description is functional and covers the basics with an explicit 'Use when' clause, which is good. However, the capability descriptions lean toward abstract concepts ('design principles', 'review criteria') rather than concrete actions, and the trigger terms could be expanded to capture more natural user phrasings. The scope is reasonably distinct but could be tightened to avoid overlap with general testing or code review skills.
Suggestions
Replace abstract nouns with concrete actions, e.g., 'Generates integration and E2E test skeletons, calculates test ROI, defines test boundaries and review checklists' instead of 'design principles' and 'review criteria'.
Expand trigger terms in the 'Use when' clause to include natural variations like 'end-to-end tests', 'testing strategy', 'test plan', 'API testing', or 'system tests'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Names the domain (integration/E2E testing) and lists some actions (design principles, ROI calculation, test skeleton specification, review criteria), but these are somewhat abstract rather than concrete actions like 'generate test skeletons' or 'calculate test ROI metrics'. | 2 / 3 |
Completeness | Clearly answers both 'what' (integration/E2E test design principles, ROI calculation, test skeleton specification, review criteria) and 'when' with an explicit 'Use when' clause specifying designing integration tests, E2E tests, or reviewing test quality. | 3 / 3 |
Trigger Term Quality | Includes relevant terms like 'integration tests', 'E2E tests', and 'test quality', but misses common variations users might say such as 'end-to-end tests', 'testing strategy', 'test coverage', 'test plan', or 'acceptance tests'. | 2 / 3 |
Distinctiveness Conflict Risk | Focuses on integration and E2E testing specifically, which helps distinguish it from unit testing skills, but 'reviewing test quality' is broad enough to potentially overlap with general code review or unit testing skills. | 2 / 3 |
Total | 9 / 12 Passed |
Implementation
50%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-structured policy/specification document for integration and E2E test design with strong ROI calculation examples and clear budget constraints. Its main weaknesses are the lack of executable test skeleton examples in any concrete language, the absence of an explicit step-by-step workflow for the test design process, and some redundancy across sections. It reads more as a reference specification than an actionable skill that guides Claude through a specific task.
Suggestions
Add at least one complete, executable test skeleton example (e.g., a fixture-e2e test in Playwright or an integration test in Jest/RTL) showing all required comment annotations in context, rather than just the annotation template.
Add an explicit numbered workflow section (e.g., '## Test Design Workflow') that sequences the steps: identify candidates → classify lane → calculate ROI → rank and select within budget → write skeleton → validate annotations → implement.
Consider splitting ROI calculation details and review criteria into separate reference files, keeping SKILL.md as a concise overview with clear pointers to each reference.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is mostly efficient and provides genuinely useful tables and formulas, but includes some redundancy (e.g., the lane selection rationale is repeated across multiple sections, and the multi-step user journey classification table partially restates earlier lane selection guidance). Some sections like EARS Format Mapping and Quality Standards state things Claude already knows (AAA pattern, test independence). | 2 / 3 |
Actionability | The skill provides concrete annotation patterns, naming conventions, ROI formulas with worked examples, and clear selection criteria — all actionable. However, it lacks executable code examples (no actual test skeleton in any language, no real test file example), making it more of a specification/policy document than a copy-paste-ready guide. The comment pattern block is pseudocode-like rather than a complete test skeleton. | 2 / 3 |
Workflow Clarity | The skill clearly defines test types, budgets, ROI thresholds, and selection criteria, which implicitly form a workflow (identify candidates → calculate ROI → rank → select within budget → write skeleton → review). However, this workflow is never explicitly sequenced as steps. There are no validation checkpoints for the test design process itself — e.g., no 'verify deduplication before finalizing selection' or 'validate skeleton annotations before implementation' steps. | 2 / 3 |
Progressive Disclosure | The skill references one external file (references/e2e-design.md) which is well-signaled, but no bundle files were provided to verify it exists. The document itself is quite long (~180 lines) and could benefit from splitting the ROI calculation details, review criteria, and skeleton specification into separate reference files, keeping SKILL.md as a concise overview with pointers. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
68ecb4a
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.