CtrlK
BlogDocsLog inGet started
Tessl Logo

pantheon-ai/skill-quality-auditor

Audit and improve skill collections with a 9-dimension scoring framework (Knowledge Delta, Mindset, Anti-Patterns, Specification Compliance, Progressive Disclosure, Freedom Calibration, Pattern Recognition, Practical Usability, Eval Validation), duplication detection, remediation planning, baseline comparison, and CI quality gates; use when evaluating skill quality, generating remediation plans, detecting duplicates, validating artifact conventions, or enforcing publication thresholds.

93

1.26x
Quality

89%

Does it follow best practices?

Impact

99%

1.26x

Average score across 5 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Evaluation results

100%

38%

Task: Skill Quality Assessment

Criteria
Without context
With context

9-dimension framework applied

16%

100%

Redundant content identified

93%

100%

Knowledge Delta scored low

50%

100%

Numerical scores per dimension

100%

100%

A-grade threshold referenced

0%

100%

Actionable remediation steps

85%

100%

Specification compliance issues noted

62%

100%

Progressive disclosure gap noted

37%

100%

100%

45%

Task: Skills Collection Quality Audit

Criteria
Without context
With context

skill-auditor batch used

0%

100%

--store flag used

0%

100%

--json flag used

0%

100%

Baseline comparison performed

70%

100%

Grade thresholds applied

100%

100%

New skills handled

100%

100%

Trend analysis present

100%

100%

Reproducible commands documented

40%

100%

99%

15%

Task: Skill Improvement Planning

Criteria
Without context
With context

Executive summary present

100%

100%

Critical issues table

100%

100%

Phase-based organisation

93%

100%

Specific file changes

94%

100%

Measurable success criteria

100%

100%

S/M/L effort sizing

60%

100%

Verification commands

66%

91%

A-grade target achievable

25%

100%

98%

-2%

Task: Anti-Pattern Documentation Enhancement

Criteria
Without context
With context

NEVER statements used

100%

86%

WHY: explanations present

100%

100%

BAD code examples

100%

100%

GOOD code examples

100%

100%

All 3 original issues covered

100%

100%

At least 4 anti-patterns total

100%

100%

Score impact explained

100%

100%

100%

11%

Task: Skills Portfolio Duplication Analysis

Criteria
Without context
With context

Pairwise comparison performed

100%

100%

Similarity percentages calculated

100%

100%

20% threshold applied

100%

100%

35% threshold applied

100%

100%

NEVER wrong-domain aggregation

60%

100%

Navigation Hub pattern referenced

100%

100%

duplication-report.json valid

100%

100%

Specific keep-separate justification

37%

100%

Evaluated
Agent
Claude
Model
Claude Sonnet 4.6

Table of Contents