Evidence-first pull request review with independent critique, selective challenger review, and human handoff.
89
92%
Does it follow best practices?
Impact
89%
Average score across 43 eval scenarios
Risky
Do not use without reviewing