Audit and improve skill collections with a 9-dimension scoring framework (Knowledge Delta, Mindset, Anti-Patterns, Specification Compliance, Progressive Disclosure, Freedom Calibration, Pattern Recognition, Practical Usability, Eval Validation), duplication detection, remediation planning, baseline comparison, and CI quality gates; use when evaluating skill quality, generating remediation plans, detecting duplicates, validating artifact conventions, or enforcing publication thresholds.
93
89%
Does it follow best practices?
Impact
99%
1.26xAverage score across 5 eval scenarios
Passed
No known issues
Navigation hub for evaluating, maintaining, and improving skill quality with 9-dimension framework scoring.
Build once, then audit:
Build once:
bun run build:skill-auditorRun audits:
# Single skill
skill-auditor evaluate <domain>/<skill-name> --json --store
# Batch with grade gate
skill-auditor batch <skill1> <skill2> --fail-below B --storeskill-auditor evaluate <skill> --json --storeremediation-plan.md and focus on the lowest-scoring dimensionEnsure you review Detailed Anti-Patterns for all WHY/BAD/GOOD failure modes including agent name references and D4 heading rules.
Remediation workflow:
skill-auditor evaluate documentation/markdown-authoring --json --store
# Score: 98/140 (C+) -> review remediation-plan.md -> fix -> re-audit -> 128/140 (A)PR-scoped triage:
skills=$(git diff --name-only origin/main | grep "skills/.*/SKILL.md" | sed 's|skills/||;s|/SKILL.md||' | tr '\n' ' ')
skill-auditor batch $skills --fail-below B --storeAudit all skills:
skill-auditor batch $(find skills -name "SKILL.md" | sed 's|skills/||;s|/SKILL.md||' | tr '\n' ' ')See Audit Workflow Examples for input/output pairs and CI quality gate examples.
skill-auditor evaluate agentic-harness/skill-quality-auditor --json
# Expected: A grade, total >= 126/140| Topic | Reference | When to Use |
|---|---|---|
| Per-dimension criteria and bonus rules | Dimensions | Evaluating any dimension or understanding the rubric |
| Score thresholds and grade bands | Scoring Rubric | Calculating a total score or assigning a grade |
| A-grade checklist and red flags | Quality Standards | Targeting A-grade or reviewing blockers |
| Trigger pattern density and keyword analysis | Pattern Recognition | Scoring D7 or improving description keywords |
| Canonical SKILL.md structure and References table standard | SKILL Template | Authoring or refactoring a skill |
| Topic | Reference | When to Use |
|---|---|---|
| CI gate configuration and batch pass/fail logic | Quality Thresholds | Setting up CI quality gates |
| NEVER/WHY/BAD/GOOD failure modes per dimension | Anti-Patterns | Explaining low scores or writing remediation guidance |
| T-shirt sizing and remediation roadmaps | Remediation Planning | Writing a remediation plan for a C/D-grade skill |
| Deduplication workflow and aggregation guidance | Duplication Detection | Detecting skill overlap or planning aggregations |
skill-auditor evaluate/batch usage and output formats | Scripts Workflow | Running audits from the command line |
| Registry publication gates and tessl compliance checks | Tessl Compliance | Preparing a skill for public registry submission |