CtrlK
CommunityDocumentationLog inGet started
Tessl Logo

ab-test-setup

tessl i github:coreyhaines31/marketingskills --skill ab-test-setup

When the user wants to plan, design, or implement an A/B test or experiment. Also use when the user mentions "A/B test," "split test," "experiment," "test this change," "variant copy," "multivariate test," or "hypothesis." For tracking implementation, see analytics-tracking.

88%

Overall

SKILL.md
Review
Evals

Validation

81%
CriteriaDescriptionResult

metadata_version

'metadata' field is not a dictionary

Warning

license_field

'license' field is missing

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

13

/

16

Passed

Implementation

85%

This is a well-structured, actionable skill that provides comprehensive A/B testing guidance with clear workflows and good progressive disclosure. The main weakness is some verbosity in explaining concepts Claude already understands (statistical significance, the peeking problem) and unnecessary persona framing. The concrete tables, checklists, and frameworks make this highly usable.

Suggestions

Remove the opening persona statement ('You are an expert...') and explanatory content about basic concepts like statistical significance that Claude already knows

Condense the 'Common Mistakes' section into a brief checklist rather than categorized explanations

DimensionReasoningScore

Conciseness

The content is generally efficient but includes some unnecessary framing ('You are an expert...') and explanatory text that Claude already knows (e.g., explaining what statistical significance means, what peeking is). The tables and frameworks are well-structured but could be tighter.

2 / 3

Actionability

Provides concrete, actionable guidance throughout: specific hypothesis templates, sample size tables with exact numbers, implementation checklists, and clear decision frameworks. The examples are specific and the tables provide copy-paste ready reference material.

3 / 3

Workflow Clarity

Clear sequential workflow from hypothesis through analysis with explicit checkpoints: pre-launch checklist, during-test DO/DON'T lists, and analysis checklist. The process is well-sequenced with validation steps (QA, tracking verification) before proceeding.

3 / 3

Progressive Disclosure

Well-organized with clear sections, appropriate use of tables for reference data, and explicit links to deeper content (sample-size-guide.md, test-templates.md). The skill serves as an effective overview with one-level-deep references to detailed materials.

3 / 3

Total

11

/

12

Passed

Activation

90%

This is a well-structured description with excellent trigger term coverage and clear disambiguation from related skills. The main weakness is that the 'what' portion could be more specific about concrete capabilities beyond the general 'plan, design, or implement.' The description effectively prioritizes when-to-use guidance, which is valuable for skill selection.

Suggestions

Add 2-3 specific concrete actions to strengthen specificity, e.g., 'create test variants, define success metrics, calculate required sample sizes, analyze experiment results'

DimensionReasoningScore

Specificity

Names the domain (A/B testing/experiments) and mentions actions like 'plan, design, or implement,' but doesn't list specific concrete capabilities like 'create test variants, calculate sample sizes, analyze results.'

2 / 3

Completeness

Clearly answers both what (plan, design, implement A/B tests/experiments) and when (explicit 'Use when' equivalent at the start plus extensive trigger terms). Also includes helpful disambiguation pointing to analytics-tracking for related but distinct needs.

3 / 3

Trigger Term Quality

Excellent coverage of natural terms users would say: 'A/B test,' 'split test,' 'experiment,' 'test this change,' 'variant copy,' 'multivariate test,' 'hypothesis' - these are all terms users naturally use when needing this skill.

3 / 3

Distinctiveness Conflict Risk

Clear niche with distinct triggers specific to experimentation. The explicit disambiguation ('For tracking implementation, see analytics-tracking') actively reduces conflict risk with related skills.

3 / 3

Total

11

/

12

Passed

Reviewed

Table of Contents

ValidationImplementationActivation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.