Integration and E2E test design principles, ROI calculation, test skeleton specification, and review criteria. Use when designing integration tests, E2E tests, or reviewing test quality.
53
58%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/integration-e2e-testing/SKILL.mdQuality
Discovery
67%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description is functional and follows good structure with an explicit 'Use when' clause, clearly separating what the skill does from when to use it. However, the capability descriptions lean toward abstract concepts ('design principles', 'review criteria') rather than concrete actions, and the trigger terms could be expanded to cover more natural user phrasings like 'end-to-end' or 'testing strategy'.
Suggestions
Replace abstract nouns with concrete actions, e.g., 'Generates integration and E2E test skeletons, calculates test ROI, and evaluates test quality against review criteria'.
Expand trigger terms to include common variations: 'end-to-end tests', 'testing strategy', 'test plan', 'test coverage', 'acceptance testing', 'smoke tests'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Names the domain (integration/E2E testing) and lists some actions (design principles, ROI calculation, test skeleton specification, review criteria), but these are somewhat abstract rather than concrete actions like 'generate test skeletons' or 'calculate test ROI metrics'. | 2 / 3 |
Completeness | Clearly answers both what ('Integration and E2E test design principles, ROI calculation, test skeleton specification, and review criteria') and when ('Use when designing integration tests, E2E tests, or reviewing test quality') with an explicit 'Use when' clause. | 3 / 3 |
Trigger Term Quality | Includes relevant terms like 'integration tests', 'E2E tests', and 'test quality', but misses common variations users might say such as 'end-to-end tests', 'testing strategy', 'test coverage', 'test plan', or 'acceptance tests'. | 2 / 3 |
Distinctiveness Conflict Risk | Focuses on integration/E2E testing which is somewhat specific, but could overlap with general testing skills, unit testing skills, or test automation skills. The 'review criteria' aspect could also conflict with code review skills. | 2 / 3 |
Total | 9 / 12 Passed |
Implementation
50%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill provides a comprehensive framework for integration and E2E test design with useful domain-specific knowledge (ROI formulas, lane budgets, selection criteria). Its main weaknesses are the lack of executable test code examples and the absence of a clear step-by-step workflow for the test design process. The content is well-organized with tables and headers but could be more concise and better split across reference files.
Suggestions
Add at least one complete, executable test skeleton example in a common language (e.g., TypeScript with Playwright for fixture-e2e, or a Jest integration test) showing the required comment annotations in context with actual test code.
Add an explicit numbered workflow section (e.g., 'How to Design Tests for a Feature') that sequences the steps: identify candidates → classify lanes → calculate ROI → rank and select within budget → write skeletons → review against criteria.
Move the detailed ROI calculation examples table and the multi-step user journey classification details into a reference file (e.g., references/roi-examples.md) to reduce the main document's token footprint.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is mostly efficient and provides genuinely useful information Claude wouldn't inherently know (ROI formulas, lane selection rules, budget limits). However, some sections are verbose — the ROI calculation examples table is extensive, and the multi-step user journey definition section could be tightened. The EARS format mapping and some table structures add moderate overhead. | 2 / 3 |
Actionability | The skill provides concrete annotation patterns, naming conventions, and ROI formulas, which are actionable. However, it lacks executable code examples — the test skeleton specification shows comment patterns but no actual test code skeleton in any language. The guidance is specific but stops short of copy-paste ready test templates. | 2 / 3 |
Workflow Clarity | The skill defines clear selection criteria (ROI thresholds, lane selection rules, budget limits) and review checklists, but lacks an explicit step-by-step workflow for designing tests. There's no sequenced process like '1. Identify candidates → 2. Calculate ROI → 3. Select within budget → 4. Write skeletons → 5. Review.' The information is organized by topic rather than by workflow, and there are no validation checkpoints for the test design process itself. | 2 / 3 |
Progressive Disclosure | There is one reference to an external file (references/e2e-design.md) which is well-signaled, but the bundle shows no files were provided, making it impossible to verify. The main document itself is quite long and monolithic — sections like the detailed ROI examples table and multi-step journey classification could be split into reference files. The structure has clear headings but the content density suggests it would benefit from more aggressive splitting. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
adf2e4d
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.