"A comprehensive verification system for Claude Code sessions."
40
50%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Loading evals