General-purpose coding policy for Baruch's AI agents
92
91%
Does it follow best practices?
Impact
93%
Average score across 10 eval scenarios
Advisory
Suggest reviewing before use