Cross-validate CLI flags, docs, tests, and targets for consistency across the codebase. Use this skill whenever the user asks to: audit the codebase, check for consistency issues, find undocumented flags, verify test coverage, validate targets.yaml, check handler split conventions, or verify oplog instrumentation. This is a read-only audit — it reports issues but never modifies files. Use after large refactors, before releases, or whenever you suspect docs/code/tests have drifted out of sync.
85
82%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that clearly articulates specific capabilities (cross-validating CLI flags, docs, tests, targets), provides comprehensive trigger terms that developers would naturally use, and explicitly states both what it does and when to use it. The description also helpfully clarifies its read-only nature and provides contextual usage scenarios (after refactors, before releases).
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'Cross-validate CLI flags, docs, tests, and targets', 'audit the codebase', 'check for consistency issues', 'find undocumented flags', 'verify test coverage', 'validate targets.yaml', 'check handler split conventions', 'verify oplog instrumentation'. | 3 / 3 |
Completeness | Clearly answers both what ('Cross-validate CLI flags, docs, tests, and targets for consistency') AND when ('Use this skill whenever the user asks to: audit the codebase...', 'Use after large refactors, before releases, or whenever you suspect docs/code/tests have drifted out of sync'). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural terms users would say: 'audit', 'consistency issues', 'undocumented flags', 'test coverage', 'targets.yaml', 'refactors', 'releases', 'docs/code/tests have drifted out of sync'. These are terms developers naturally use. | 3 / 3 |
Distinctiveness Conflict Risk | Very specific niche focused on CLI flag/docs/tests consistency validation with distinct triggers like 'targets.yaml', 'handler split conventions', 'oplog instrumentation'. The read-only audit nature and specific use cases make it unlikely to conflict with other skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
64%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a solid audit skill with highly actionable bash commands and clear reporting formats. The main weaknesses are the duplicated Output Format section, the inconsistency between '4 dimensions' and the actual 7 dimensions present, and the length that could benefit from progressive disclosure to separate files.
Suggestions
Remove the duplicate Output Format section (appears twice with the second version being more complete)
Fix the inconsistency: change 'Run all 4 dimensions in parallel' to 'Run all 7 dimensions in parallel' to match actual content
Consider splitting detailed dimension instructions into separate files (e.g., AUDIT-FLAGS.md, AUDIT-TESTS.md) with SKILL.md as an overview
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is mostly efficient with clear tables and bash commands, but has some redundancy - the Output Format section appears twice with overlapping content, and some explanatory text could be tightened. | 2 / 3 |
Actionability | Provides fully executable bash commands for each audit dimension, specific file paths to check, clear status categories with definitions, and concrete grep patterns that are copy-paste ready. | 3 / 3 |
Workflow Clarity | The 7 audit dimensions are clearly defined with specific checks, but lacks explicit sequencing guidance and validation checkpoints. The instruction to 'run all 4 dimensions in parallel' contradicts having 7 dimensions, creating confusion. | 2 / 3 |
Progressive Disclosure | Content is well-organized with clear section headers and tables, but everything is inline in one file. For a skill this comprehensive (7 audit dimensions), some content could be split into separate reference files for each dimension. | 2 / 3 |
Total | 9 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
053ecb4
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.