General-purpose coding policy for Baruch's AI agents
93
97%
Does it follow best practices?
Impact
93%
1.82xAverage score across 18 eval scenarios
Advisory
Suggest reviewing before use
APPROVED, CHANGES_REQUESTED, COMMENTED) classifies whether the review gates the merge — it never classifies whether the review's content must be readCOMMENTED and "non-blocking" describe the merge gate, never permission to skip the review bodyCOMMENTED review with zero inline comments still carries a body that must be read.tessl-plugin
evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
scenario-11
scenario-12
scenario-13
scenario-14
scenario-15
scenario-16
scenario-17
scenario-18
rules
skills
adopt-fork-pr
eval-curation
install-reviewer
migrate-to-plugin