Evidence-first pull request review with independent critique, selective challenger review, and human handoff.
87
92%
Does it follow best practices?
Impact
87%
Average score across 43 eval scenarios
Risky
Do not use without reviewing