CtrlK
BlogDocsLog inGet started
Tessl Logo

pantheon-ai/skill-quality-auditor

Audit and improve skill collections with a 9-dimension scoring framework (Knowledge Delta, Mindset, Anti-Patterns, Specification Compliance, Progressive Disclosure, Freedom Calibration, Pattern Recognition, Practical Usability, Eval Validation), duplication detection, remediation planning, baseline comparison, and CI quality gates; use when evaluating skill quality, generating remediation plans, detecting duplicates, validating artifact conventions, or enforcing publication thresholds.

93

1.26x
Quality

89%

Does it follow best practices?

Impact

99%

1.26x

Average score across 5 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

eval-format.mdreferences/

Eval Scenario Format

All skills must use evals/scenario-NN.md — one Markdown file per scenario, numbered from 01.

Required Structure

# Scenario NN: Title

## User Prompt

"Exact trigger phrase the user would type."

## Expected Behavior

1. Step the agent takes
2. Next step
3. ...

## Success Criteria

- Measurable outcome 1
- Measurable outcome 2

## Failure Conditions

- What a bad agent response looks like
- Another failure mode

All four sections are required. Success criteria must be measurable (files created, commands run, specific output verified) — never vague ("agent does well").

Quantity

Minimum 5 scenarios per skill. Target 7–9 for skills with broad trigger surfaces.

Cover:

  • Primary happy path
  • Edge cases and partial inputs
  • Failure / anti-pattern detection
  • At least one scenario where the skill should refuse or warn

File Naming

evals/scenario-01.md, evals/scenario-02.md, … evals/scenario-09.md

Zero-padded two digits. No gaps in numbering.

Tile.json

List each scenario file in the files array:

{
  "files": [
    "evals/scenario-01.md",
    "evals/scenario-02.md"
  ]
}

Non-Standard Formats (do not use)

FormatProblem
evals/*.yamlNot linkable from tile.json files; diverges from markdown-first convention
evals.md (single file)Cannot reference individual scenarios; does not scale beyond 3–4 scenarios
evals/instructions.jsonMeta-artifact from a retired eval framework; remove if present
evals/summary.jsonRetired; remove if present

SKILL.md

tile.json