ab-test-setup

Structured guide for setting up A/B tests with mandatory gates for hypothesis, metrics, and execution readiness.

1.01x

Quality

47%

Does it follow best practices?

Impact

100%

1.01x

Average score across 3 eval scenarios

Securityby

Passed

No known issues

Fix and improve this skill with Tessl

tessl review fix ./docs/v19.7/configuration/agent/skills_external/antigravity-awesome-skills-main/skills/ab-test-setup/SKILL.md

Evaluation results

100%

Checkout Flow Test — Results Review

Test results analysis and record

Criteria

Baseline

With context

No-ship recommendation

100%

Guardrail failure cited

100%

Record: hypothesis

100%

Record: variants

100%

Record: metrics

100%

Record: sample size vs achieved

100%

Record: results

100%

Record: decision

100%

Record: learnings

66%

100%

Record: follow-up ideas

100%

No over-generalization

100%

Stat vs business separated

100%

External factors documented

100%

Growth Team Test Proposal Assessment

Refusal conditions and design flaw identification

Criteria

Baseline

With context

Refuses to proceed

100%

Multiple variables flagged

100%

Unknown baseline flagged

100%

Traffic insufficiency flagged

100%

Undefined primary metric flagged

100%

Peeking risk flagged

100%

Hypothesis quality issues

87%

100%

Next steps recommended

100%

Single test recommendation

100%

No false approval

100%

Onboarding Completion Test Plan

Full A/B test design document

Criteria

Baseline

With context

Hypothesis: observation

100%

Hypothesis: single change

100%

Hypothesis: directional expectation

100%

Hypothesis: defined audience

100%

Hypothesis: MDE specified

100%

Test type A/B

100%

Single primary metric

100%

Guardrail metrics defined

100%

Significance level 95%

100%

Statistical power 80%

100%

Sample size per variant

100%

Assumptions listed

100%

Execution readiness checklist

100%

No implementation steps

100%

Repository: duclm1x1/Dive-Ai
Commit: 20ba150

Evaluated: 4 months ago
Agent: Claude Code
Model: Claude Sonnet 4.6

Table of Contents

Onboarding Completion Test Plan Checkout Flow Test — Results Review Growth Team Test Proposal Assessment

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.