Audit and build the infrastructure a repo needs so agents can work autonomously — boot scripts, smoke tests, CI/CD gates, dev environment setup, observability, and isolation. Use when a repo can't boot, tests are broken or missing, there's no dev environment, agents can't verify their work, or agents need human help to get anything done. Do not use for reviewing an existing diff or for documentation-only cleanup.
94
94%
Does it follow best practices?
Impact
95%
1.13xAverage score across 3 eval scenarios
Passed
No known issues
Make a repo ready for autonomous agent work by adding mechanical proof: boot scripts, smoke checks, CI/hooks, observable signals, and isolation where needed. Add the smallest useful layer first; stop once the repo is reliably verifiable.
Concrete examples:
pnpm dev, cargo run, or docker compose upcurl http://127.0.0.1:3000/healthpnpm exec playwright testGrade the repo across these dimensions:
For each, report:
pass / partial / failUse references/grading.md. Lowest dimension sets the overall grade.
Also scan unattended-run constraints: session independence, explicit artifact paths, resource bounds, infrastructure-enforced permissions, and direct CLI/HTTP/file interfaces for dashboard-only flows. If these are not needed for the current task, keep them as remaining gaps instead of expanding the scope.
Example output:
bootable: partial — `pnpm dev` starts the app after manual env setup
testable: fail — only mocked tests under test/
observable: partial — health endpoint exists, structured logs missing
verifiable: fail — no stable smoke or interaction script
overall grade: DBuild missing layers in this order:
Boot → Smoke → Interact → E2e → Enforce → Observe → Isolate
Each step should be independently useful. Stop once the repo is reliably verifiable.
Prioritize one canonical local gate (make verify, just verify, ./scripts/verify.sh, or equivalent) that agents can run before push. It should mirror meaningful CI checks enough to catch routine failures without opening a dashboard.
When readiness work includes agent entrypoints, keep AGENTS.md as the canonical authored guide and place CLAUDE.md beside it as a symlink to AGENTS.md rather than maintaining two separate guidance files.
See references/setup-patterns.md for local gates, boot scripts, e2e, observability, isolation, containerized stacks, and tooling-version ownership.
Tighten weak or flaky layers:
When the repo reaches C+ and can be judged honestly, stop readiness work and report the next natural phase. If changes created doc drift, report the documentation gap instead of expanding the scope.
After readiness work, report in this compact bullet shape:
- grade: before → after- evidence: concise explanations of what readiness checks proved- files changed: changed readiness files- remaining gaps: highest-impact gaps only, or none- next: runtime proof, independent review, documentation cleanup, human review, or noneKeep details compact: