A comprehensive verification system for Claude Code sessions.
51
38%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./.agents/skills/verification-loop/SKILL.mdQuality
Discovery
0%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This description is critically underspecified. It reads as a vague label rather than a functional description, providing no concrete actions, no trigger terms, and no guidance on when to use the skill. It would be nearly impossible for Claude to correctly select this skill from a pool of alternatives.
Suggestions
Specify the concrete actions this skill performs (e.g., 'Validates code output, checks for regressions, verifies test results') instead of the vague 'comprehensive verification system'.
Add an explicit 'Use when...' clause with natural trigger terms (e.g., 'Use when the user asks to verify, validate, or check the correctness of code changes, test outputs, or session results').
Clarify what 'Claude Code sessions' means in practical terms and what distinguishes this verification from general testing or code review skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description uses vague, abstract language ('comprehensive verification system') without naming any concrete actions. It does not specify what is being verified or how. | 1 / 3 |
Completeness | The description fails to clearly answer 'what does this do' (what specifically is verified?) and completely lacks any 'when should Claude use it' guidance. There is no 'Use when...' clause or equivalent. | 1 / 3 |
Trigger Term Quality | The only potentially relevant term is 'verification' and 'Claude Code sessions,' but these are not natural keywords a user would say. A user would more likely say 'check,' 'validate,' 'test,' or describe a specific verification task. | 1 / 3 |
Distinctiveness Conflict Risk | 'Verification system' is extremely generic and could overlap with testing skills, linting skills, code review skills, or any quality assurance-related skill. There are no distinct triggers to differentiate it. | 1 / 3 |
Total | 4 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a solid, actionable verification workflow skill with clear sequencing and executable commands across six well-defined phases. Its main weaknesses are some unnecessary sections (Continuous Mode, Integration with Hooks) that add little value, and the lack of any bundle structure to offload language-specific variations. The stop-gates and structured output format are strong points that make this practically useful.
Suggestions
Remove or significantly trim the 'Continuous Mode' and 'Integration with Hooks' sections, which add padding without actionable value.
Consider splitting language-specific commands (JS/TS vs Python) into separate referenced files to reduce inline noise and improve progressive disclosure.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is reasonably structured but includes some unnecessary padding like the 'Continuous Mode' section with vague advice ('set a mental checkpoint'), the 'Integration with Hooks' section that adds little value, and the 'When to Use' section which is somewhat obvious. The core verification phases are fairly lean though. | 2 / 3 |
Actionability | Each phase provides concrete, executable bash commands with specific tools (npm, tsc, pyright, ruff, grep). The output format template is copy-paste ready, and the commands include practical touches like piping to tail/head for manageable output. | 3 / 3 |
Workflow Clarity | The six phases are clearly sequenced with explicit stop-gates ('If build fails, STOP and fix before continuing'). The workflow follows a logical dependency chain (build → types → lint → tests → security → diff review) with a structured output report that serves as a verification checkpoint. | 3 / 3 |
Progressive Disclosure | The content is entirely inline in a single file with no references to supporting files, which is acceptable for a skill of this size. However, the security scan patterns, language-specific commands, and the output template could benefit from being split out or better organized with clear navigation for different project types. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
841beea
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.