Run structured Codex/Claude autoreview closeout: choose the target, collect schema-validated findings, and rerun tests plus review until clean.
80
100%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Loading evals