CtrlK
BlogDocsLog inGet started
Tessl Logo

ab-test-analyzer

Ab Test Analyzer - Auto-activating skill for Data Analytics. Triggers on: ab test analyzer, ab test analyzer Part of the Data Analytics skill category.

36

0.98x

Quality

3%

Does it follow best practices?

Impact

98%

0.98x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./planned-skills/generated/12-data-analytics/ab-test-analyzer/SKILL.md
SKILL.md
Quality
Evals
Security

Evaluation results

96%

-4%

Checkout Experiment Analysis

Statistical significance testing

Criteria
Without context
With context

Statistical test used

100%

100%

P-value computed

100%

100%

Confidence interval reported

100%

60%

Conversion rates calculated

100%

100%

Sample sizes reported

100%

100%

Step-by-step structure

100%

100%

Go/no-go recommendation

100%

100%

Script is runnable

100%

100%

Output captured

100%

100%

Significance threshold stated

100%

100%

Methodology explained

100%

100%

Without context: $0.3832 · 1m 46s · 23 turns · 24 in / 5,765 out tokens

With context: $0.5528 · 2m 3s · 29 turns · 28 in / 7,354 out tokens

100%

Feature Adoption Experiment — Query and Visualize Results

SQL querying and data visualization

Criteria
Without context
With context

SQL conversion rate query

100%

100%

SQL time-to-adoption query

100%

100%

SQL daily trend query

100%

100%

Conversion rate chart produced

100%

100%

Daily trend chart produced

100%

100%

Script reads SQLite database

100%

100%

Tabular results printed

100%

100%

Step-by-step structure

100%

100%

Findings written

100%

100%

Both variants compared

100%

100%

Script is runnable

100%

100%

Without context: $0.5110 · 1m 56s · 27 turns · 27 in / 7,239 out tokens

With context: $0.5614 · 2m 13s · 31 turns · 62 in / 7,120 out tokens

99%

Pricing Experiment — Executive Summary Report

Business intelligence reporting

Criteria
Without context
With context

Executive summary present

100%

100%

Key metrics reported

100%

100%

Statistical significance addressed

100%

100%

Business lift quantified

100%

100%

Recommendation stated

100%

100%

Risks and caveats section

100%

100%

Methodology described

100%

100%

Validation script checks non-zero samples

100%

100%

Validation script checks rate bounds

87%

87%

Validation script checks lift consistency

100%

100%

Validation output captured

100%

100%

Both business metrics covered

100%

100%

Without context: $0.3496 · 1m 47s · 14 turns · 15 in / 6,509 out tokens

With context: $0.5972 · 2m 33s · 27 turns · 59 in / 9,335 out tokens

Repository
jeremylongshore/claude-code-plugins-plus-skills
Evaluated
Agent
Claude Code
Model
Claude Sonnet 4.6

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.