Review existing code, diffs, branches, or pull requests using concern-specific reviewer personas and evidence. Use when auditing someone else's work, triaging risk in a PR, or producing a ship-it / needs-review / blocked verdict. Do not use to verify your own completed change; use `verify` for that.
98
100%
Does it follow best practices?
Impact
96%
1.20xAverage score across 4 eval scenarios
Passed
No known issues
any type flagged
100%
100%
Unsafe cast flagged
0%
100%
Non-null assertion flagged
100%
100%
Dead code identified
100%
100%
Duplicate logic identified
100%
100%
Catch-all error flagged
100%
100%
Error classification recommended
100%
100%
Narrating comments flagged
100%
100%
Findings tied to impact
100%
100%
Valid verdict
0%
100%
Compact verdict block
62%
100%
CLAUDE.md loaded
100%
100%
File references in findings
100%
100%
Line-level evidence
100%
100%
Verdict present
62%
100%
Findings by severity
100%
100%
Error classification finding
100%
100%
Silent failure finding
75%
100%
Mock-heavy test concern
33%
100%
Dead code identified
100%
100%
Unverified surfaces marked
12%
100%
Recommended follow-up
100%
100%
No nit inflation
83%
100%
Personas listed
87%
100%
Compact verdict block
50%
100%
Default personas used
75%
87%
Types persona included
70%
100%
Cleanup persona included
75%
100%
Comments persona omitted
100%
100%
Repo guidance loaded
100%
100%
Personas listed in output
100%
83%
Verdict present
37%
100%
Scope stated
83%
83%
Findings ordered by severity
100%
100%
Evidence for findings
100%
100%
Silent failures finding
100%
100%
Unverified areas acknowledged
0%
100%
Recommended follow-up
100%
100%
Compact verdict block
66%
50%
Minimal personas selected
91%
83%
General persona used
87%
87%
Comments persona used
90%
90%
Tests persona omitted
100%
100%
Verdict present
60%
100%
Scope stated
87%
87%
Personas listed in output
87%
87%
No nit inflation
100%
100%
Docstring accuracy noted
100%
100%
Unverified areas or residual risk
100%
100%
Recommended follow-up
20%
100%
Compact verdict block
87%
62%