tessl i github:coreyhaines31/marketingskills --skill ab-test-setupWhen the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," or "hypothesis." For tracking implementation, see analytics-tracking.
Validation
81%| Criteria | Description | Result |
|---|---|---|
metadata_version | 'metadata' field is not a dictionary | Warning |
license_field | 'license' field is missing | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 13 / 16 Passed | |
Implementation
85%This is a well-structured, actionable skill that provides comprehensive A/B testing guidance with clear workflows and good progressive disclosure. The main weakness is some verbosity in explaining concepts Claude already understands (statistical significance, the peeking problem) and unnecessary persona framing. The concrete tables, checklists, and frameworks make this highly usable.
Suggestions
Remove the opening persona statement ('You are an expert...') and explanatory content about basic concepts like statistical significance that Claude already knows
Condense the 'Common Mistakes' section into a brief checklist rather than categorized explanations
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is generally efficient but includes some unnecessary framing ('You are an expert...') and explanatory text that Claude already knows (e.g., explaining what statistical significance means, what peeking is). The tables and frameworks are well-structured but could be tighter. | 2 / 3 |
Actionability | Provides concrete, actionable guidance throughout: specific hypothesis templates, sample size tables with exact numbers, implementation checklists, and clear decision frameworks. The examples are specific and the tables provide copy-paste ready reference material. | 3 / 3 |
Workflow Clarity | Clear sequential workflow from hypothesis through analysis with explicit checkpoints: pre-launch checklist, during-test DO/DON'T lists, and analysis checklist. The process is well-sequenced with validation steps (QA, tracking verification) before proceeding. | 3 / 3 |
Progressive Disclosure | Well-organized with clear sections, appropriate use of tables for reference data, and explicit links to deeper content (sample-size-guide.md, test-templates.md). The skill serves as an effective overview with one-level-deep references to detailed materials. | 3 / 3 |
Total | 11 / 12 Passed |
Activation
90%This is a well-structured description with excellent trigger term coverage and clear disambiguation from related skills. The main weakness is that the 'what' portion could be more specific about concrete capabilities beyond the general 'plan, design, or implement.' The description effectively prioritizes when-to-use guidance, which is valuable for skill selection.
Suggestions
Add 2-3 specific concrete actions to strengthen specificity, e.g., 'create test variants, define success metrics, calculate required sample sizes, analyze experiment results'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Names the domain (A/B testing/experiments) and mentions actions like 'plan, design, or implement,' but doesn't list specific concrete capabilities like 'create test variants, calculate sample sizes, analyze results.' | 2 / 3 |
Completeness | Clearly answers both what (plan, design, implement A/B tests/experiments) and when (explicit 'Use when' equivalent at the start plus extensive trigger terms). Also includes helpful disambiguation pointing to analytics-tracking for related but distinct needs. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural terms users would say: 'A/B test,' 'split test,' 'experiment,' 'test this change,' 'variant copy,' 'multivariate test,' 'hypothesis' - these are all terms users naturally use when needing this skill. | 3 / 3 |
Distinctiveness Conflict Risk | Clear niche with distinct triggers specific to experimentation. The explicit disambiguation ('For tracking implementation, see analytics-tracking') actively reduces conflict risk with related skills. | 3 / 3 |
Total | 11 / 12 Passed |
Reviewed
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.