Evidence-first pull request review with independent critique, selective challenger review, and human handoff.
93
94%
Does it follow best practices?
Impact
93%
Average score across 43 eval scenarios
Passed
No known issues