integration-e2e-testing

Integration and E2E test design principles, ROI calculation, test skeleton specification, and review criteria. Use when designing integration tests, E2E tests, or reviewing test quality.

Quality

58%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/integration-e2e-testing/SKILL.md

Quality

Discovery

67%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is functional and covers the basics with an explicit 'Use when' clause, which is good. However, the capability descriptions lean toward abstract concepts ('design principles', 'review criteria') rather than concrete actions, and the trigger terms could be expanded to capture more natural user phrasings. The scope is reasonably distinct but could be tightened to avoid overlap with general testing or code review skills.

Suggestions

Replace abstract nouns with concrete actions, e.g., 'Generates integration and E2E test skeletons, calculates test ROI, defines test boundaries and review checklists' instead of 'design principles' and 'review criteria'.

Expand trigger terms in the 'Use when' clause to include natural variations like 'end-to-end tests', 'testing strategy', 'test plan', 'API testing', or 'system tests'.

Dimension	Reasoning	Score
Specificity	Names the domain (integration/E2E testing) and lists some actions (design principles, ROI calculation, test skeleton specification, review criteria), but these are somewhat abstract rather than concrete actions like 'generate test skeletons' or 'calculate test ROI metrics'.	2 / 3
Completeness	Clearly answers both 'what' (integration/E2E test design principles, ROI calculation, test skeleton specification, review criteria) and 'when' with an explicit 'Use when' clause specifying designing integration tests, E2E tests, or reviewing test quality.	3 / 3
Trigger Term Quality	Includes relevant terms like 'integration tests', 'E2E tests', and 'test quality', but misses common variations users might say such as 'end-to-end tests', 'testing strategy', 'test coverage', 'test plan', or 'acceptance tests'.	2 / 3
Distinctiveness Conflict Risk	Focuses on integration and E2E testing specifically, which helps distinguish it from unit testing skills, but 'reviewing test quality' is broad enough to potentially overlap with general code review or unit testing skills.	2 / 3
	Total	9 / 12 Passed

Implementation

50%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured policy/specification document for integration and E2E test design with strong ROI calculation examples and clear budget constraints. Its main weaknesses are the lack of executable test skeleton examples in any concrete language, the absence of an explicit step-by-step workflow for the test design process, and some redundancy across sections. It reads more as a reference specification than an actionable skill that guides Claude through a specific task.

Suggestions

Add at least one complete, executable test skeleton example (e.g., a fixture-e2e test in Playwright or an integration test in Jest/RTL) showing all required comment annotations in context, rather than just the annotation template.

Add an explicit numbered workflow section (e.g., '## Test Design Workflow') that sequences the steps: identify candidates → classify lane → calculate ROI → rank and select within budget → write skeleton → validate annotations → implement.

Consider splitting ROI calculation details and review criteria into separate reference files, keeping SKILL.md as a concise overview with clear pointers to each reference.

Dimension	Reasoning	Score
Conciseness	The content is mostly efficient and provides genuinely useful tables and formulas, but includes some redundancy (e.g., the lane selection rationale is repeated across multiple sections, and the multi-step user journey classification table partially restates earlier lane selection guidance). Some sections like EARS Format Mapping and Quality Standards state things Claude already knows (AAA pattern, test independence).	2 / 3
Actionability	The skill provides concrete annotation patterns, naming conventions, ROI formulas with worked examples, and clear selection criteria — all actionable. However, it lacks executable code examples (no actual test skeleton in any language, no real test file example), making it more of a specification/policy document than a copy-paste-ready guide. The comment pattern block is pseudocode-like rather than a complete test skeleton.	2 / 3
Workflow Clarity	The skill clearly defines test types, budgets, ROI thresholds, and selection criteria, which implicitly form a workflow (identify candidates → calculate ROI → rank → select within budget → write skeleton → review). However, this workflow is never explicitly sequenced as steps. There are no validation checkpoints for the test design process itself — e.g., no 'verify deduplication before finalizing selection' or 'validate skeleton annotations before implementation' steps.	2 / 3
Progressive Disclosure	The skill references one external file (references/e2e-design.md) which is well-signaled, but no bundle files were provided to verify it exists. The document itself is quite long (~180 lines) and could benefit from splitting the ROI calculation details, review criteria, and skeleton specification into separate reference files, keeping SKILL.md as a concise overview with pointers.	2 / 3
	Total	8 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: shinpr/claude-code-workflows
Commit: 68ecb4a

Reviewed: about 13 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.