Auto-generated tile from GitHub (10 skills)
92
94%
Does it follow best practices?
Impact
92%
1.16xAverage score across 44 eval scenarios
Advisory
Suggest reviewing before use
{
"context": "Tests whether the agent uses the gh CLI correctly for all GitHub operations, uses --body-file for PR body content, sets explicit --base and --head flags when opening a PR, waits for CI after opening the PR, and never includes AI co-authorship attributions. The scenario evaluates the octocat skill's specific workflow preferences.",
"type": "weighted_checklist",
"checklist": [
{
"name": "gh CLI only",
"description": "All GitHub operations are performed using the `gh` CLI. No instructions suggest using the GitHub web interface, REST API via curl, or other GitHub clients.",
"max_score": 10
},
{
"name": "--body-file flag",
"description": "The `gh pr create` command uses `--body-file <path>` instead of `--body \"...\"` for the PR description. The PR body content is written to a temporary file first.",
"max_score": 12
},
{
"name": "Explicit --base and --head",
"description": "The `gh pr create` command includes explicit `--base <branch>` and `--head <branch>` flags, not relying on defaults.",
"max_score": 10
},
{
"name": "CI check wait command",
"description": "After opening the PR, the script or instructions include `gh pr checks <num> --watch` (or equivalent with `2>&1`) to wait for CI to complete.",
"max_score": 12
},
{
"name": "No AI co-authorship",
"description": "No commit message, PR body, or script output contains 'Co-Authored-By: Claude', 'Co-Authored-By: Codex', 'Generated by ChatGPT', or any similar AI attribution string.",
"max_score": 14
},
{
"name": "No signing config changes",
"description": "The script does NOT modify git signing configuration (`user.signingkey`, `gpg.program`, `commit.gpgsign`) in any git config command.",
"max_score": 10
},
{
"name": "Structured PR body content",
"description": "The PR body written to the temporary file is structured markdown with at least a summary/description section (not a single line of plain text).",
"max_score": 8
},
{
"name": "Shell script artifact",
"description": "The output includes a shell script (`.sh`) that automates the described PR workflow, so the process is reproducible and inspectable.",
"max_score": 8
},
{
"name": "Rebase validation step",
"description": "If interactive rebase is performed, the script or instructions include `git log --oneline -n 10` (or similar) to verify history after the rebase.",
"max_score": 8
},
{
"name": "Temp file cleanup",
"description": "Any temporary files created for the PR body (e.g., `/tmp/pr-body.md`) are removed after use, or the script is structured so they don't persist.",
"max_score": 8
}
]
}evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
scenario-11
scenario-12
scenario-13
scenario-14
scenario-15
scenario-16
scenario-17
scenario-18
scenario-19
scenario-20
scenario-21
scenario-22
scenario-23
scenario-24
scenario-25
scenario-26
scenario-27
scenario-28
scenario-29
scenario-30
scenario-31
scenario-32
scenario-33
scenario-34
scenario-35
scenario-36
scenario-37
scenario-38
scenario-39
scenario-40
scenario-41
scenario-42
scenario-43
scenario-44
skills
documentation
fastify
init
linting-neostandard-eslint9
node
nodejs-core
rules
oauth
octocat
snipgrapher