Evidence-first pull request review with independent critique, selective challenger review, and human handoff.
87
93%
Does it follow best practices?
Impact
87%
1.31xAverage score across 43 eval scenarios
Risky
Do not use without reviewing
An additional independent review pass that strengthens or weakens candidate findings.
enable_challenger_on settingThis skill must be usable in two configurations:
This allows apples-to-apples comparison between intra-model and cross-model review.
pr-evidence-builderDo NOT read primary reviewer conclusions. The challenger always reviews independently.
Verify independence before starting. Confirm you have not read any files from the primary review output directory. Do not access findings from fresh-eyes-review. If any primary review content is in context, stop and restart with a clean context.
Review the evidence pack and raw diff independently. Approach the code as a separate critic with no knowledge of prior findings.
Produce candidate findings. Same output format as fresh-eyes-review. Each finding must meet the evidence threshold. If a finding does not meet the threshold, discard it rather than downgrading its severity.
For each finding, indicate whether it:
Same schema as fresh-eyes-review, with source set to "challenger". Example:
{
"source": "challenger",
"file": "src/auth/token.ts",
"line": 42,
"severity": "high",
"classification": "confirms",
"title": "JWT secret falls back to hardcoded value",
"evidence": "process.env.JWT_SECRET ?? 'dev-secret' means any deployment missing the env var silently uses a known secret.",
"recommendation": "Remove the fallback entirely and throw on missing secret at startup."
}Classification values:
"confirms" — independent evidence supporting an issue that may also appear in the primary review"refutes" — evidence that a suspected concern is actually safe"novel" — finding not likely surfaced by the primary reviewerevals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
scenario-11
scenario-12
scenario-13
scenario-14
scenario-15
scenario-16
scenario-17
scenario-18
scenario-19
scenario-20
scenario-21
scenario-22
scenario-23
scenario-24
scenario-25
scenario-26
scenario-27
scenario-28
scenario-29
scenario-30
scenario-31
scenario-32
scenario-33
scenario-34
scenario-35
scenario-36
scenario-37
scenario-38
scenario-39
scenario-40
scenario-41
scenario-42
scenario-43
rules
skills
challenger-review
finding-synthesizer
fresh-eyes-review
human-review-handoff
pr-evidence-builder
review-retrospective