General-purpose coding policy for Baruch's AI agents
96
90%
Does it follow best practices?
Impact
97%
1.24xAverage score across 14 eval scenarios
Passed
No known issues
A mid-sized engineering team has recently adopted GitHub Copilot for automated code review across all their repositories. The team's release process currently requires developers to manually open PRs and then navigate the GitHub UI to add reviewers — a tedious process that often results in PRs sitting unreviewed because developers forget the extra step. The team lead wants a reusable release.sh bash script that every developer can run locally to open a PR and immediately trigger a Copilot review, so nothing slips through.
The team has tried using gh pr edit --add-reviewer in the past but found bot accounts were silently ignored. They need an approach that reliably works with bot reviewers. The script should also handle the case where the bot's identifier becomes outdated over time, and it should confirm that the review request was actually registered before returning.
Produce a release.sh bash script that:
The script does not need to actually run successfully in this environment (no GitHub credentials available) — focus on correctness of the implementation.
evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
scenario-11
scenario-12
scenario-13
scenario-14
rules
skills
eval-curation
install-reviewer