Verify your own completed code changes using the repo's existing infrastructure and an independent evaluator context. Use after implementing a change when you need to run unit or integration tests, check build or lint gates, prove the real surface works with evidence, and challenge the changed code for clarity, deduplication, and maintainability. If the repo is not verifiable yet, hand off to `agent-readiness`; if you are reviewing someone else's code, use `review`.
97
100%
Does it follow best practices?
Impact
89%
1.02xAverage score across 3 eval scenarios
Passed
No known issues
Use the existing infrastructure to prove your own change works before calling it done.
AGENTS.md, CLAUDE.md, or repo rules before judging the resultagent-readinessagent-readinessreviewdocsAGENTS.md, CLAUDE.md, or repo rules, when presentmake verify, just verify, pnpm test, cargo test, or the nearest targeted equivalentcurl http://127.0.0.1:3000/healthnode dist/cli.js --help or the repo's packaged entrypointFollow references/evidence-rules.md when collecting proof.
any, unsafe as, boundary-leaking unknown, and non-null assertions as safety failures unless the repo explicitly allows themUse references/simplification.md for the exact simplification questions.
Produce one clear outcome:
ship itneeds reviewblockedIf blocked because the infrastructure is weak, say so explicitly and hand off to agent-readiness.
After verification, report:
agent-readiness, docs, or implementationExample:
verdict: needs review
change verified: retry banner after transient API failure
surfaces exercised: pnpm test test/retry.spec.ts, curl http://127.0.0.1:3000/api/retry
code-shape finding: low — retry counter update is split across two helpers with identical branching; merge into one explicit path
finding: medium — the UI recovers, but the retry count is not persisted across refresh
evidence: local API returned 200 after retry; browser screenshot after refresh shows count reset to 0
recommended follow-up: implementation