General-purpose coding policy for Baruch's AI agents
90
91%
Does it follow best practices?
Impact
90%
Average score across 18 eval scenarios
Advisory
Suggest reviewing before use