uinaf/verify

Verify your own completed code changes using the repo's existing infrastructure and an independent evaluator context. Use after implementing a change when you need to run unit or integration tests, check build or lint gates, prove the real surface works with evidence, and challenge the changed code for clarity, deduplication, and maintainability. If the repo is not verifiable yet, hand off to `agent-readiness`; if you are reviewing someone else's code, use `review`.

1.02x

Quality

98%

Does it follow best practices?

Impact

94%

1.02x

Average score across 3 eval scenarios

Securityby

Passed

No known issues

Evidence Rules

Name: uinaf/verify
Rating: 97.2 (1 reviews)
Author: uinaf

Verification without proof is just flattering yourself.

What Counts as Evidence

Prefer reproducible proof:

commands or runtime surfaces exercised, recorded exactly in the work log when needed for reproduction
screenshots tied to a named flow
HTTP requests and responses
representative CLI output for failures or meaningful status lines
structured logs, traces, or health checks
concrete error messages, codes, and recovery hints from an exercised failure path
file references with line numbers for static findings

Rules

Screenshots are evidence, not verdicts
- A nice screenshot does not prove the flow works end to end
Swallow boring success output
- Keep success terse; surface failures and anomalies
Tie every finding to impact
- say what breaks, who it affects, or why the risk matters
Name the exact surface exercised
- page, endpoint, command, state transition, config path
For error-handling findings, capture the surfaced failure
- include the actual error text, code/classification, and whether the message helps the user recover
Flag unverified claims honestly
- if you could not hit the real surface, say unverified and why
Do not flood context with giant logs
- quote only the relevant lines or summarize with exact pointers
Keep final evidence human-readable
- in the final footer, summarize passing checks by intent and result; include full commands only for failures, reproduction, or when asked

Minimum Output Shape

For each meaningful finding:

severity
summary
evidence
impact
suggested fix or handoff

For the overall verification:

verdict: ready for review / needs more work / blocked (verify never issues ship it — that's review's call)
change verified
surfaces exercised
evidence summary: what passed or failed, not a raw command list unless reproduction requires it