CtrlK
BlogDocsLog inGet started
Tessl Logo

uinaf/verify

Verify your own completed code changes using the repo's existing infrastructure and an independent evaluator context. Use after implementing a change when you need to run unit or integration tests, check build or lint gates, prove the real surface works with evidence, and challenge the changed code for clarity, deduplication, and maintainability. Do not use when the repo is not verifiable yet or when reviewing someone else's code.

93

1.01x
Quality

94%

Does it follow best practices?

Impact

92%

1.01x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Quality

Content

85%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-crafted skill with strong workflow clarity, excellent actionability through concrete commands and examples, and good progressive disclosure via referenced supporting files. The main weakness is mild verbosity — the Principles, Handoffs, and Before You Start sections contain some redundancy that could be consolidated without losing clarity. Overall, it's a high-quality skill that effectively guides Claude through a pre-review verification process.

Suggestions

Consolidate the Principles and Handoffs sections — several points (e.g., 'verify is not a ship decision', 'independent review is separate') are stated multiple times across the intro, principles, handoffs, and workflow synthesis.

DimensionReasoningScore

Conciseness

The skill is mostly efficient and well-structured, but some principles and workflow steps could be tightened. Phrases like 'Verify proves the change boots, passes guardrails, and survives the real surface; it is not a ship decision' repeat what's already conveyed elsewhere. The Principles and Handoffs sections have some redundancy with each other and with the workflow steps.

2 / 3

Actionability

The skill provides concrete, executable guidance throughout: specific commands like `make verify`, `curl http://127.0.0.1:3000/health`, `node dist/cli.js --help`; clear examples of self-corrections (typos, unused imports, `any` types); a concrete output template with an example. The workflow steps are specific and copy-paste ready where appropriate.

3 / 3

Workflow Clarity

The 5-step workflow is clearly sequenced with explicit validation checkpoints: run guardrails first, exercise the real surface, self-correct, probe adjacent risk, then synthesize verdict. The feedback loop is present (fix and re-run), and the verdict system provides clear decision gates. The 'Before You Start' section adds pre-flight checks, and the handoffs section clarifies boundaries.

3 / 3

Progressive Disclosure

The skill provides a clear overview with well-signaled one-level-deep references to three supporting files (verification.md, evidence-rules.md, simplification.md). Content is appropriately split between the main skill and references. The inline references within workflow steps are contextually placed where they're needed.

3 / 3

Total

11

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong, well-crafted skill description that clearly defines its purpose, provides abundant natural trigger terms, and explicitly delineates both when to use and when not to use the skill. The inclusion of specific outputs (verdict types) and explicit exclusions makes it highly distinctive and unlikely to conflict with related skills like code review or deployment skills.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: run repo guardrails (lint, typecheck, tests, build), exercise the real surface with evidence, catch self-correctable issues, and produces a specific verdict output (ready for review / needs more work / blocked).

3 / 3

Completeness

Clearly answers both 'what' (self-check completed changes, run guardrails, exercise real surface, produce a verdict) and 'when' (explicit 'Use when' clause with multiple triggers, plus a 'Do not use when' clause that further clarifies scope and boundaries).

3 / 3

Trigger Term Quality

Excellent coverage of natural trigger terms users would say: 'check your work', 'run checks', 'validate changes', 'ready', 'test it end-to-end', 'lint', 'typecheck', 'tests', 'build'. These are phrases a user would naturally use when wanting a pre-review sanity check.

3 / 3

Distinctiveness Conflict Risk

Clearly carved niche as a pre-review self-check skill with distinct scope boundaries. The 'Do not use when' clause explicitly excludes auditing others' diffs/PRs, and the 'never a ship decision' constraint further distinguishes it from deployment or review approval skills.

3 / 3

Total

12

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Reviewed

Table of Contents