Structured self-debugging workflow for AI agent failures using capture, diagnosis, contained recovery, and introspection reports.
45
56%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Loading evals