General-purpose coding policy for Baruch's AI agents
90
91%
Does it follow best practices?
Impact
90%
1.30xAverage score across 18 eval scenarios
Advisory
Suggest reviewing before use
A platform team manages a monorepo where several developers merge PRs throughout the day. After a merge, developers frequently forget to sync their local main branch, leave stale local branches around, and walk away without confirming the release actually published. This leads to subtle issues: developers push new work on top of outdated local main, accidentally re-create deleted branches, and only notice the release never went out hours later when a downstream consumer reports it missing.
The team wants a merge-and-cleanup.sh bash script that a developer can run once a PR is ready to land. The script should handle the merge itself, clean up the local and remote state, and confirm the release actually published — giving the developer a clear final status rather than leaving them to check manually.
Produce a merge-and-cleanup.sh bash script that:
The script does not need to run successfully (no GitHub credentials available) — focus on correctness of the approach and commands used.
.tessl-plugin
evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
scenario-11
scenario-12
scenario-13
scenario-14
scenario-15
scenario-16
scenario-17
scenario-18
rules
skills
adopt-fork-pr
eval-curation
install-reviewer