Name: uinaf/verify
Rating: 93.6 (1 reviews)
Author: uinaf

uinaf/verify

Verify your own completed code changes using the repo's existing infrastructure and an independent evaluator context. Use after implementing a change when you need to run unit or integration tests, check build or lint gates, prove the real surface works with evidence, and challenge the changed code for clarity, deduplication, and maintainability. Do not use when the repo is not verifiable yet or when reviewing someone else's code.

1.01x

Quality

94%

Does it follow best practices?

Impact

92%

1.01x

Average score across 3 eval scenarios

Securityby

Passed

No known issues

Quality

Content

85%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-crafted skill with strong workflow clarity, excellent actionability through concrete commands and examples, and good progressive disclosure via referenced supporting files. The main weakness is mild verbosity — the Principles, Handoffs, and Before You Start sections contain some redundancy that could be consolidated without losing clarity. Overall, it's a high-quality skill that effectively guides Claude through a pre-review verification process.

Suggestions

Consolidate the Principles and Handoffs sections — several points (e.g., 'verify is not a ship decision', 'independent review is separate') are stated multiple times across the intro, principles, handoffs, and workflow synthesis.

Dimension	Reasoning	Score
Conciseness	The skill is mostly efficient and well-structured, but some principles and workflow steps could be tightened. Phrases like 'Verify proves the change boots, passes guardrails, and survives the real surface; it is not a ship decision' repeat what's already conveyed elsewhere. The Principles and Handoffs sections have some redundancy with each other and with the workflow steps.	2 / 3
Actionability	The skill provides concrete, executable guidance throughout: specific commands like `make verify`, `curl http://127.0.0.1:3000/health`, `node dist/cli.js --help`; clear examples of self-corrections (typos, unused imports, `any` types); a concrete output template with an example. The workflow steps are specific and copy-paste ready where appropriate.	3 / 3
Workflow Clarity	The 5-step workflow is clearly sequenced with explicit validation checkpoints: run guardrails first, exercise the real surface, self-correct, probe adjacent risk, then synthesize verdict. The feedback loop is present (fix and re-run), and the verdict system provides clear decision gates. The 'Before You Start' section adds pre-flight checks, and the handoffs section clarifies boundaries.	3 / 3
Progressive Disclosure	The skill provides a clear overview with well-signaled one-level-deep references to three supporting files (verification.md, evidence-rules.md, simplification.md). Content is appropriately split between the main skill and references. The inline references within workflow steps are contextually placed where they're needed.	3 / 3
	Total	11 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong, well-crafted skill description that clearly defines its purpose, provides abundant natural trigger terms, and explicitly delineates both when to use and when not to use the skill. The inclusion of specific outputs (verdict types) and explicit exclusions makes it highly distinctive and unlikely to conflict with related skills like code review or deployment skills.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: run repo guardrails (lint, typecheck, tests, build), exercise the real surface with evidence, catch self-correctable issues, and produces a specific verdict output (ready for review / needs more work / blocked).	3 / 3
Completeness	Clearly answers both 'what' (self-check completed changes, run guardrails, exercise real surface, produce a verdict) and 'when' (explicit 'Use when' clause with multiple triggers, plus a 'Do not use when' clause that further clarifies scope and boundaries).	3 / 3
Trigger Term Quality	Excellent coverage of natural trigger terms users would say: 'check your work', 'run checks', 'validate changes', 'ready', 'test it end-to-end', 'lint', 'typecheck', 'tests', 'build'. These are phrases a user would naturally use when wanting a pre-review sanity check.	3 / 3
Distinctiveness Conflict Risk	Clearly carved niche as a pre-review self-check skill with distinct scope boundaries. The 'Do not use when' clause explicitly excludes auditing others' diffs/PRs, and the 'never a ship decision' constraint further distinguishes it from deployment or review approval skills.	3 / 3
	Total	12 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Reviewed

4 days ago

Table of Contents

Discovery Implementation Validation