Run conversion rate optimization through hypothesis-driven testing including audit, hypothesis generation, test design, statistical analysis, and rollout decisions. Use this skill whenever the user wants to optimize conversion, run A/B tests, audit a funnel, generate test hypotheses, design experiments, or analyze test results. Triggers on conversion optimization, CRO, A/B test, split test, multivariate test, hypothesis, conversion funnel, funnel audit, experiment design, statistical significance, lift, optimization. Also triggers when the user has a conversion problem and isn't sure where to start, or when test results are ambiguous and need interpretation.
72
88%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that covers all key dimensions thoroughly. It provides specific concrete actions, comprehensive trigger terms including both technical and natural language variations, explicit 'Use when' guidance, and occupies a distinct niche. The inclusion of edge cases (ambiguous results, not knowing where to start) is a particularly strong touch.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: audit, hypothesis generation, test design, statistical analysis, and rollout decisions. These are clearly defined stages of a CRO workflow. | 3 / 3 |
Completeness | Clearly answers both 'what' (hypothesis-driven testing including audit, hypothesis generation, test design, statistical analysis, rollout decisions) and 'when' (explicit 'Use this skill whenever...' clause plus detailed trigger list and edge-case scenarios). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural terms users would say: 'conversion optimization', 'CRO', 'A/B test', 'split test', 'multivariate test', 'funnel audit', 'statistical significance', 'lift'. Also covers ambiguous scenarios like 'conversion problem and isn't sure where to start'. | 3 / 3 |
Distinctiveness Conflict Risk | Occupies a clear niche around conversion rate optimization and A/B testing. The specific domain terminology (CRO, funnel audit, split test, statistical significance, lift) makes it highly unlikely to conflict with general analytics or marketing skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a strong, well-structured CRO skill that provides genuinely actionable frameworks—the hypothesis template, decision matrix, sample size table, and output format are all immediately usable. The main weakness is length: some sections (statistical foundations, failure patterns) could be more concise or moved to reference files, and a few explanations cover ground Claude already knows. Overall, it's a high-quality process skill that would effectively guide CRO work.
Suggestions
Move the 'Statistical foundations' section to a reference file (e.g., references/statistical-foundations.md) and link to it from the main skill—Claude knows basic statistics and this section is primarily reference material.
Trim the 'Failure patterns' section to the top 5 most critical patterns; several overlap with anti-patterns already covered in the test design section.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is generally well-structured but includes some content Claude already knows (e.g., explaining what 95% significance means, basic statistical concepts). The statistical foundations section and some of the anti-patterns/failure patterns sections could be tightened. However, the domain-specific frameworks (hypothesis structure, ICE/PIE, decision matrix) earn their tokens. | 2 / 3 |
Actionability | The skill provides highly concrete, actionable guidance: a specific hypothesis template with fill-in-the-blank structure, a complete sample size reference table, a decision framework table with clear conditions, a full output file template with exact markdown structure, and specific criteria for when to ship/kill/extend. While there's no executable code (appropriate for a strategy/process skill), every section gives specific, copy-paste-ready frameworks. | 3 / 3 |
Workflow Clarity | The 10-step workflow is clearly sequenced with explicit validation checkpoints (e.g., 'Don't peek. Don't stop early,' decision criteria defined before launch, QA step before running). The decision framework table provides clear feedback loops for different outcomes, and the test design section explicitly addresses common mistakes that would invalidate results. The audit→hypothesize→test→decide framework is well-structured with clear phase gates. | 3 / 3 |
Progressive Disclosure | The skill references one external file (references/hypothesis-library.md) and cross-references two other skills (landing-page-copy, analytics-strategy), which is good. However, the content is quite long (~300 lines) and some sections like the statistical foundations and the extensive failure patterns list could be split into reference files. The inline content is well-organized with clear headers but could benefit from more aggressive splitting. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
8e70d03
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.