CtrlK
BlogDocsLog inGet started
Tessl Logo

jbvc/verification-loop

A comprehensive verification system for Claude Code sessions.

40

Quality

51%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a solid, actionable verification workflow skill with clear sequencing and executable commands across six well-defined phases. Its main weaknesses are minor verbosity in framing sections (When to Use, Continuous Mode, Integration with Hooks) and the lack of bundle files to offload language-specific variations or extended security patterns. The core workflow is strong with appropriate stop-gates and a structured output template.

Suggestions

Remove or significantly trim the 'When to Use', 'Continuous Mode', and 'Integration with Hooks' sections — they add little actionable value and consume tokens.

Consider extracting language-specific command variants (Python vs JS/TS) into a separate reference file to keep the main skill leaner and more focused.

DimensionReasoningScore

Conciseness

The skill is reasonably structured but includes some unnecessary content like the 'Continuous Mode' section with vague advice ('set a mental checkpoint'), the 'Integration with Hooks' section that adds little value, and the 'When to Use' section which is somewhat obvious. The commands themselves are lean, but the surrounding prose could be tightened.

2 / 3

Actionability

Each phase provides concrete, executable bash commands that are copy-paste ready. The output format template is specific and actionable. Commands include practical touches like piping to tail/head for manageable output.

3 / 3

Workflow Clarity

The six phases are clearly sequenced with explicit stop-gates (e.g., 'If build fails, STOP and fix before continuing'). The workflow progresses logically from build → types → lint → tests → security → diff review, with a structured output report that serves as a final validation checkpoint.

3 / 3

Progressive Disclosure

The content is well-organized with clear section headers, but everything is in a single monolithic file with no references to supporting files. The security scan patterns, output format template, and per-language variations could be split into separate reference files for better organization, though for a skill of this size it's borderline acceptable.

2 / 3

Total

10

/

12

Passed

Description

0%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This description is critically underspecified. It reads as a vague tagline rather than a functional skill description, providing no concrete actions, no trigger terms, and no guidance on when to use it. Without knowing what is being verified, how, or in what context, Claude would have no reliable basis for selecting this skill over any other.

Suggestions

Specify the concrete actions this skill performs (e.g., 'Validates code output, checks for regressions, verifies test results') instead of the vague 'comprehensive verification system.'

Add an explicit 'Use when...' clause with natural trigger terms (e.g., 'Use when the user asks to verify, validate, or check the correctness of code changes, test outputs, or session results').

Clarify the distinct niche this skill occupies—what kind of verification, for what artifacts, and how it differs from testing or linting skills.

DimensionReasoningScore

Specificity

The description uses vague, abstract language ('comprehensive verification system') without naming any concrete actions. It does not specify what is being verified or how.

1 / 3

Completeness

The description fails to clearly answer 'what does this do' (what specifically is verified?) and completely lacks any 'when should Claude use it' guidance. There is no 'Use when...' clause or equivalent.

1 / 3

Trigger Term Quality

The only potentially relevant terms are 'verification' and 'Claude Code sessions,' which are not natural keywords a user would say. There are no actionable trigger terms like 'check,' 'validate,' 'test,' or specific task-related words.

1 / 3

Distinctiveness Conflict Risk

'Verification system' is extremely generic and could overlap with testing, linting, code review, validation, or any number of quality-assurance-related skills. There is nothing to distinguish it from other skills.

1 / 3

Total

4

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Reviewed

Table of Contents