Audit and improve skill collections with a 9-dimension scoring framework (Knowledge Delta, Mindset, Anti-Patterns, Specification Compliance, Progressive Disclosure, Freedom Calibration, Pattern Recognition, Practical Usability, Eval Validation), duplication detection, remediation planning, baseline comparison, and CI quality gates; use when evaluating skill quality, generating remediation plans, detecting duplicates, validating artifact conventions, or enforcing publication thresholds.
93
89%
Does it follow best practices?
Impact
99%
1.26xAverage score across 5 eval scenarios
Passed
No known issues
skill-auditor evaluate infrastructure/terraform-generator --json --storeOutput:
{
"skill": "infrastructure/terraform-generator",
"grade": "B+",
"total": 122,
"dimensions": {
"D1_knowledge_delta": 18,
"D2_mindset_procedures": 13,
"D3_anti_pattern_quality": 12,
"D4_specification_compliance": 14,
"D5_progressive_disclosure": 13,
"D6_freedom_calibration": 12,
"D7_pattern_recognition": 8,
"D8_practical_usability": 14,
"D9_eval_validation": 18
}
}Next steps based on output:
skill-auditor batch \
infrastructure/terraform-generator \
ci-cd/github-actions-generator \
development/bash-script-generator \
--store
# Compare against previous baseline
diff <(cat .context/audits/infrastructure/terraform-generator/2025-12-01/audit.json) \
<(cat .context/audits/infrastructure/terraform-generator/$(date +%Y-%m-%d)/audit.json)skill-auditor evaluate documentation/markdown-authoring --json --store
# Score: 98/140 (C+) -> blocked from publishing
cat .context/audits/documentation/markdown-authoring/$(date +%Y-%m-%d)/remediation-plan.md
# Outputs prioritized fixes:
# 1. [HIGH] D3 Anti-Patterns: Add 5 anti-patterns with BAD/GOOD examples (S effort)
# 2. [HIGH] D5 Progressive Disclosure: Restructure Quick Start -> Guide -> Advanced (M effort)
# 3. [MED] D7 Pattern Recognition: Expand description keywords (S effort)
# After applying fixes:
skill-auditor evaluate documentation/markdown-authoring --json --store
# Score: 128/140 (A) -> publication-ready# .github/workflows/skill-quality.yml
- name: Build skill-auditor
run: bun run build:skill-auditor
- name: Audit changed skills
run: |
skills=$(git diff --name-only origin/main | grep "skills/.*/SKILL.md" | sed 's|skills/||;s|/SKILL.md||' | tr '\n' ' ')
skill-auditor batch $skills --fail-below B --store
# Exits 1 if any skill scores below B-grade (112/140)assets
evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
references
scripts