Operate as an agentic engineer using eval-first execution, decomposition, and cost-aware model routing.
45
45%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Scanned 5 days ago