Audit and improve skill collections with an 8-dimension scoring framework, duplication detection, remediation planning, and CI quality gates; use when evaluating skill quality, generating remediation plans, validating report format, or enforcing repository-wide skill artifact conventions.
97
100%
Does it follow best practices?
Impact
93%
1.32xAverage score across 5 eval scenarios
Passed
No known issues
8-dimension skill evaluation
8-dimension framework
16%
100%
Knowledge Delta assessment
70%
100%
120-point scoring system
0%
100%
A-grade target
0%
100%
Expert vs redundant classification
80%
100%
Missing expert knowledge
83%
100%
Specification compliance check
50%
100%
Progressive disclosure evaluation
42%
100%
Numerical scoring breakdown
90%
100%
Mindset principles
40%
60%
Self-audit awareness
0%
0%
Actionable recommendations
100%
100%
Audit workflow automation
Audit script execution
50%
83%
JSON output format
62%
87%
Baseline comparison
91%
91%
Directory structure organization
20%
60%
NEVER skip baseline rule
71%
100%
Score threshold application
12%
100%
Grade assignment
100%
100%
Trend analysis
88%
100%
PR workflow consideration
75%
100%
Skill artifact validation
0%
33%
Consistency checks
0%
83%
Reproducible audit process
100%
100%
Remediation plan creation
Executive summary format
80%
100%
Critical issues table
90%
100%
Phase-based organization
100%
100%
Remediation script usage
70%
50%
Schema validation
50%
80%
NEVER validation rule
83%
91%
Specific file modifications
100%
100%
Success criteria metrics
100%
100%
T-shirt sizing effort
50%
100%
Code block escaping
50%
25%
Honest quality rating
0%
100%
Anti-pattern documentation
NEVER statements
93%
100%
WHY explanations
91%
100%
Concrete examples
93%
100%
Side-by-side comparisons
66%
100%
Consequence descriptions
93%
100%
Strong language usage
87%
100%
Real-world scenarios
90%
100%
Security vulnerability focus
100%
100%
Anti-pattern organization
100%
100%
Skill consolidation analysis
Similarity percentage calculation
100%
100%
20% threshold application
16%
100%
35% critical threshold
0%
80%
Text similarity analysis
100%
100%
NEVER aggregate low-similarity
100%
100%
Domain fit evaluation
100%
100%
Consolidation recommendations
100%
100%
Pairwise comparison
100%
100%
Structural analysis
66%
100%