General-purpose coding policy for Baruch's AI agents
91
93%
Does it follow best practices?
Impact
91%
Average score across 12 eval scenarios
Advisory
Suggest reviewing before use