CtrlK
BlogDocsLog inGet started
Tessl Logo

jbvc/verification-loop

A comprehensive verification system for Claude Code sessions.

40

Quality

51%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Quality

Discovery

0%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This description is critically underspecified. It fails to explain what specific actions the skill performs, what 'verification' means in this context, and when Claude should select it. The buzzword 'comprehensive' adds no informational value, and the lack of concrete triggers or use cases makes it nearly impossible to distinguish from other skills.

Suggestions

Replace 'comprehensive verification system' with specific concrete actions, e.g., 'Validates tool outputs, checks code execution results, and confirms task completion in Claude Code sessions.'

Add an explicit 'Use when...' clause with natural trigger terms, e.g., 'Use when the user asks to verify, validate, or double-check results from a Claude Code session.'

Clarify what 'verification' entails to distinguish this skill from testing, linting, or code review skills—specify the exact scope and unique niche.

DimensionReasoningScore

Specificity

The description uses vague, abstract language ('comprehensive verification system') without naming any concrete actions. It does not specify what is being verified or how.

1 / 3

Completeness

The description weakly addresses 'what' (a verification system) but provides no 'when' clause or explicit trigger guidance. Both dimensions are very weak.

1 / 3

Trigger Term Quality

The only potentially relevant terms are 'verification' and 'Claude Code sessions,' which are technical jargon unlikely to match natural user queries. No common user-facing trigger terms are present.

1 / 3

Distinctiveness Conflict Risk

The description is extremely generic—'comprehensive verification system' could overlap with testing, linting, code review, validation, or many other skills. There is nothing to distinguish it clearly.

1 / 3

Total

4

/

12

Passed

Implementation

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a solid, actionable verification workflow skill with clear sequencing and concrete commands. Its main strengths are the executable bash commands for each phase and the explicit stop-gates between phases. Weaknesses include some unnecessary sections (Continuous Mode, Integration with Hooks) that add little actionable value, and the content could be slightly more concise by removing obvious guidance.

Suggestions

Remove or significantly trim the 'Continuous Mode' and 'Integration with Hooks' sections, which add padding without actionable value.

Consider extracting language-specific command variants (JS vs Python) into a reference table or separate file to reduce inline branching.

DimensionReasoningScore

Conciseness

The skill is reasonably structured but includes some unnecessary padding like the 'Continuous Mode' section with vague advice ('set a mental checkpoint'), the 'Integration with Hooks' section that adds little value, and the 'When to Use' section which is somewhat obvious. The core verification phases are fairly lean though.

2 / 3

Actionability

Each phase provides concrete, executable bash commands with specific tools (npm, tsc, pyright, ruff, grep). The output format template is copy-paste ready, and the commands include practical touches like piping to tail/head for manageable output.

3 / 3

Workflow Clarity

The six phases are clearly sequenced with explicit stop-gates ('If build fails, STOP and fix before continuing', 'Fix critical ones before continuing'). The workflow follows a logical dependency chain (build → types → lint → tests → security → diff review) with a final verification report that aggregates pass/fail status and determines PR readiness.

3 / 3

Progressive Disclosure

The content is entirely inline in a single file with no references to supporting files, which is acceptable for a skill of this size. However, the security scan patterns, language-specific commands, and output format template could benefit from being split out or organized with clearer navigation for a skill this long (~100 lines of substantive content).

2 / 3

Total

10

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Reviewed

Table of Contents