A comprehensive verification system for Claude Code sessions.
40
51%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Quality
Discovery
0%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This description is critically underspecified. It fails to explain what specific actions the skill performs, what 'verification' means in this context, and when Claude should select it. The buzzword 'comprehensive' adds no informational value, and the lack of concrete triggers or use cases makes it nearly impossible to distinguish from other skills.
Suggestions
Replace 'comprehensive verification system' with specific concrete actions, e.g., 'Validates tool outputs, checks code execution results, and confirms task completion in Claude Code sessions.'
Add an explicit 'Use when...' clause with natural trigger terms, e.g., 'Use when the user asks to verify, validate, or double-check results from a Claude Code session.'
Clarify what 'verification' entails to distinguish this skill from testing, linting, or code review skills—specify the exact scope and unique niche.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description uses vague, abstract language ('comprehensive verification system') without naming any concrete actions. It does not specify what is being verified or how. | 1 / 3 |
Completeness | The description weakly addresses 'what' (a verification system) but provides no 'when' clause or explicit trigger guidance. Both dimensions are very weak. | 1 / 3 |
Trigger Term Quality | The only potentially relevant terms are 'verification' and 'Claude Code sessions,' which are technical jargon unlikely to match natural user queries. No common user-facing trigger terms are present. | 1 / 3 |
Distinctiveness Conflict Risk | The description is extremely generic—'comprehensive verification system' could overlap with testing, linting, code review, validation, or many other skills. There is nothing to distinguish it clearly. | 1 / 3 |
Total | 4 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a solid, actionable verification workflow skill with clear sequencing and concrete commands. Its main strengths are the executable bash commands for each phase and the explicit stop-gates between phases. Weaknesses include some unnecessary sections (Continuous Mode, Integration with Hooks) that add little actionable value, and the content could be slightly more concise by removing obvious guidance.
Suggestions
Remove or significantly trim the 'Continuous Mode' and 'Integration with Hooks' sections, which add padding without actionable value.
Consider extracting language-specific command variants (JS vs Python) into a reference table or separate file to reduce inline branching.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is reasonably structured but includes some unnecessary padding like the 'Continuous Mode' section with vague advice ('set a mental checkpoint'), the 'Integration with Hooks' section that adds little value, and the 'When to Use' section which is somewhat obvious. The core verification phases are fairly lean though. | 2 / 3 |
Actionability | Each phase provides concrete, executable bash commands with specific tools (npm, tsc, pyright, ruff, grep). The output format template is copy-paste ready, and the commands include practical touches like piping to tail/head for manageable output. | 3 / 3 |
Workflow Clarity | The six phases are clearly sequenced with explicit stop-gates ('If build fails, STOP and fix before continuing', 'Fix critical ones before continuing'). The workflow follows a logical dependency chain (build → types → lint → tests → security → diff review) with a final verification report that aggregates pass/fail status and determines PR readiness. | 3 / 3 |
Progressive Disclosure | The content is entirely inline in a single file with no references to supporting files, which is acceptable for a skill of this size. However, the security scan patterns, language-specific commands, and output format template could benefit from being split out or organized with clearer navigation for a skill this long (~100 lines of substantive content). | 2 / 3 |
Total | 10 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
Reviewed
Table of Contents