Audit and improve skill collections with a 9-dimension scoring framework (Knowledge Delta, Mindset, Anti-Patterns, Specification Compliance, Progressive Disclosure, Freedom Calibration, Pattern Recognition, Practical Usability, Eval Validation), duplication detection, remediation planning, baseline comparison, and CI quality gates; use when evaluating skill quality, generating remediation plans, detecting duplicates, validating artifact conventions, or enforcing publication thresholds.
93
89%
Does it follow best practices?
Impact
99%
1.26xAverage score across 5 eval scenarios
Passed
No known issues
Detailed scoring methodology for the 9-dimension quality framework. Use this to understand how scores are calculated and ensure consistent evaluation.
Total Possible Score: 140 points
Passing Grade: 105 points (75%)
A-Grade Target: 126 points (90%)
Perfect Score: 140 points (100%)
| Score | Criteria | Redundancy Level |
|---|---|---|
| 18-20 | Pure expert knowledge | <5% |
| 15-17 | Mostly expert | 5-15% |
| 12-14 | Acceptable balance | 15-30% |
| 9-11 | Needs improvement | 30-50% |
| 0-8 | Failing | >50% |
Evaluation Method:
| Score | Criteria |
|---|---|
| 13-15 | Clear mindset + detailed procedures + when/when-not |
| 10-12 | Has most elements, minor gaps |
| 7-9 | Missing key element |
| 0-6 | Generic or absent |
Component Breakdown:
| Score | Criteria |
|---|---|
| 13-15 | NEVER lists + concrete examples + consequences |
| 10-12 | Has most elements |
| 7-9 | Generic warnings |
| 0-6 | Missing or weak |
Component Breakdown:
| Score | Criteria |
|---|---|
| 13-15 | Perfect spec compliance |
| 10-12 | Minor issues |
| 7-9 | Missing key elements |
| 0-6 | Non-compliant |
Component Breakdown:
Portability Requirements:
.opencode/, .claude/, .cursor/): 1 pointscripts/, references/): 1 pointBonus Points (each independent, up to +2 total):
scripts/)## References, last H2, Markdown table with Topic | Reference | When to Use columns, Reference column cells are markdown links)
| Score | Criteria |
|---|---|
| 13-15 | Navigation hub + references/ + categories |
| 10-12 | Some organization, could improve |
| 7-9 | Everything frontloaded, >300 lines |
| 0-6 | No structure, >500 lines |
Component Breakdown:
| Score | Criteria |
|---|---|
| 13-15 | Appropriate for skill type |
| 10-12 | Slightly too rigid or loose |
| 7-9 | Mismatched calibration |
| 0-6 | Completely wrong |
Calibration Types:
| Score | Criteria |
|---|---|
| 9-10 | Rich keywords, comprehensive triggers |
| 7-8 | Good keywords, could expand |
| 5-6 | Basic keywords |
| 0-4 | Missing or poor |
Evaluation Method:
| Score | Criteria |
|---|---|
| 13-15 | Concrete + runnable + clear |
| 10-12 | Most examples good |
| 7-9 | Some weak examples |
| 0-6 | Abstract or missing |
Component Breakdown:
| Score | Criteria |
|---|---|
| 17-20 | Complete evals, >=80% coverage, >=3 valid scenarios |
| 13-16 | Evals present, partial coverage |
| 7-12 | Evals directory exists, missing key files |
| 0-6 | Minimal or no eval structure |
Component Breakdown:
Enrichment: When instructions.json exists, D1 and D3 scores are enriched with instruction classification data (why_given distribution for D1, anti-pattern instruction count for D3).
| Grade | Score Range | Interpretation |
|---|---|---|
| A+ | 133-140 | Exceptional quality |
| A | 126-132 | Meets all standards |
| B+ | 119-125 | Strong, minor improvements |
| B | 112-118 | Good, some gaps |
| C+ | 105-111 | Acceptable, needs work |
| C | 98-104 | Below standard |
| D | 91-97 | Significant issues |
| F | 0-90 | Failing |
Read the entire skill, including all references if present.
Apply rubric to each of 9 dimensions independently.
Sum all 9 dimension scores for total out of 140.
Map total score to grade using grade assignment table.
For scores below A-grade, identify specific improvements needed.
High Knowledge Delta, Low Usability (18, 10): Expert content but lacks examples
Low Knowledge Delta, High Usability (10, 14): Tutorial-heavy, needs expert focus
Perfect Spec, Poor Content (15, 8): Great frontmatter, weak body
Balanced Scores (12-13 each): Consistent but not exceptional
framework-dimensions.md - Dimension definitionsframework-quality-standards.md - A-grade requirementsassets
evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
references
scripts