General-purpose coding policy for Baruch's AI agents
88
91%
Does it follow best practices?
Impact
88%
Average score across 18 eval scenarios
Advisory
Suggest reviewing before use