Reviews repositories, pull requests, diffs, and agent-generated code for reward hacking, fake completion, defensive theater, architectural bypasses, weakened guarantees, hidden fallbacks, and misleading abstractions.
98
97%
Does it follow best practices?
Impact
100%
1.09xAverage score across 6 eval scenarios
Passed
No known issues
Review code for deceptive implementation behavior, reward hacking, architectural shortcuts, defensive theater, fake completion, and false confidence. Prioritize evidence that the implementation does not honestly deliver the behavior it claims.
Before starting the review, read the relevant bundled references provided with this skill:
Apply the integrity-first rule before finalizing findings: report issues that materially affect correctness, maintainability, architectural consistency, or operational transparency. Drop or demote observations that are only stylistic, generic lint noise, or superficial cleanup unless they directly weaken the implementation contract.
Actively search for:
Use bundled scripts when helpful:
scripts/run-python-scans.sh for Python repositoriesscripts/run-ts-scans.sh for TypeScript or JavaScript repositoriesscripts/collect-signals.py <path> for lightweight pattern-based leadsEvaluation scenarios are indexed in evals/README.md.
The scripts skip missing tools and continue running available checks. Do not present missing optional tools as implementation findings.
Recommended tools: Ruff, Semgrep, basedpyright, vulture, and lint-imports for
Python; ESLint, dependency-cruiser, knip, and ts-prune for TypeScript or
JavaScript.
Do not punish simple code for being simple. A finding needs evidence that the implementation creates false confidence or violates a stated contract.
Treat adapters, optional dependencies, defensive error handling, and narrow scope as legitimate when they preserve semantics, report failures clearly, and are covered by tests.
Lead with findings ordered by severity. In file-oriented workspaces or eval
scenarios, write the review to IMPLEMENTATION_INTEGRITY_REVIEW.md; otherwise
answer directly in the conversation. Every finding must include:
If there are no integrity findings, say that directly and list any residual review limits such as unavailable tools or uninspected runtime behavior.