Challenge AI output with structured devil's-advocate protocols: anchor, verify, framing, and deep sub-commands.
68
86%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Loading evals