Review existing code, diffs, branches, or pull requests by spawning mandatory concern-specific reviewer subagents, then synthesize a ship-it / needs-review / blocked verdict.
92
97%
Does it follow best practices?
Impact
81%
1.22xAverage score across 4 eval scenarios
Passed
No known issues
Independently audit existing code by spawning concern-specific reviewer subagents, then synthesize one evidence-backed ship decision.
blocked unless the user explicitly allows a sequential fallbackgeneral, tests, and silent-failuresAGENTS.md, CLAUDE.md, or repo rules, when presentAdd conditional personas only when they add a distinct concern; use references/reviewer-selection.md for shortcuts and criteria.
Review the requested code, but inspect adjacent behavior when the risk leaks past the named diff.
Refresh the source of truth before judging branches or PRs: base branch, head SHA, diff, checks, and linked issue or specification when present.
Spawn one subagent per selected persona. Run them in parallel when the environment supports it, and keep each persona concern-focused and independent. Do not collapse the gang into one blended self-review pass.
Use this prompt shape for each subagent, filling in the persona and scope:
You are the <persona> reviewer. Read the target repo guidance, then review only <scope> against your persona concerns from skills/review-gang/reviewers/<persona>.md.
Return only material findings with file/line evidence, severity, confidence, and the proof or missing proof that changes the verdict. If you find nothing material, say "none" and name any residual unverified surface.Concrete starting points:
git diff --stat <base>...HEAD to size the changegit diff <base>...HEAD -- <path> to inspect risky filespnpm test path/to/spec when behavior claims need proofOrder findings by severity. If no findings are discovered, say that explicitly and mention any residual risk or testing gap. Choose exactly one verdict: ship it, needs review, or blocked.
After review, report in this compact bullet shape:
- findings: first, only when present; otherwise - findings: none- verdict: exactly one of ship it, needs review, or blocked- evidence: concise explanations of what checks proved, not full commands- unverified: residual risk, readiness gaps, or none- next: one of implementation, runtime verification, readiness setup, documentation cleanup, or none- notes: only for out-of-scope repo state the user must act onUse those labels explicitly. Keep the verdict label exact and omit opener, closer, apology, status preface, or conversational recap.
Prefer the active harness's best native review representation instead of a prose-heavy wall of text.
Keep detailed issue text in native findings or fallback finding bullets. Keep the verdict footer to 4 labeled lines or fewer after findings.
See references/reviewing.md for stale evidence handling and presentation details.
Example:
- finding: high — src/auth/session.ts:42 fallback returns an anonymous session when token parsing fails
- verdict: needs review
- evidence: session tests exercised token parsing failures
- unverified: malformed OAuth callback runtime behavior
- next: implementation