General-purpose coding policy for Baruch's AI agents
89
97%
Does it follow best practices?
Impact
89%
Average score across 18 eval scenarios
Advisory
Suggest reviewing before use