Content
92%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-crafted skill with a clear, actionable workflow for integrity review. It is concise, avoids explaining concepts Claude already knows, and provides specific tools, scripts, and output requirements. The main weakness is that the skill heavily depends on bundled reference files (5 docs, 3 scripts, evals) that are not provided, which limits its standalone utility and makes progressive disclosure harder to fully evaluate.
Suggestions
Provide the bundled reference files (docs/integrity-signals.md, docs/reward-hacking-patterns.md, etc.) so the skill can function as designed and progressive disclosure can be properly evaluated.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is lean and efficient. It avoids explaining what code review is or how tools work. Every section serves a distinct purpose—contract establishment, scan guidance, false positive control, output format—without redundancy or padding. | 3 / 3 |
Actionability | The skill provides a concrete 7-step review workflow, specific script paths to run, named tools for each language ecosystem, and a precise output format with required fields (category, severity, confidence, evidence, rationale, remediation). The guidance is specific and directly executable. | 3 / 3 |
Workflow Clarity | The 7-step review workflow is clearly sequenced from establishing the contract through profiling, scanning, inspecting tests, comparing against contract, correlating signals, and producing findings. It includes validation checkpoints (step 6 correlates signals before reporting, the integrity-first rule acts as a final filter, and false positive control provides a feedback loop to drop illegitimate findings). | 3 / 3 |
Progressive Disclosure | The skill references 5 bundled docs, 3 scripts, and an evals README with clear one-level-deep links, which is good structure. However, no bundle files were provided, so we cannot verify these references resolve correctly. The main content is well-organized but the referenced materials are entirely absent, making the skill incomplete in isolation. | 2 / 3 |
Total | 11 / 12 Passed |