Content
35%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill attempts to be comprehensive but suffers from significant verbosity — it packs what should be 3-4 separate reference documents into a single body. The harnessability scoring system and automation recommendation engine are well-conceived features but their full specification belongs in referenced files, not inline. The skill lacks a clear step-by-step execution workflow and executable detection logic, relying instead on descriptive tables and prose.
Suggestions
Extract the Harnessability Score table, Recommend-Automation section, and Output Split catalog into separate referenced files (e.g., `docs/skill-guides/harnessability.md`, `docs/skill-guides/recommend-automation.md`) and keep only a 2-3 line summary with pointer in the main body.
Add a clear numbered workflow sequence (e.g., '1. Check for existing audit → 2. Scan repo structure → 3. Detect stack signals → 4. Calculate harnessability → 5. Generate splits → 6. Validate output completeness → 7. Persist') with explicit validation checkpoints.
Replace detection prose in tables with concrete tool invocations (e.g., `find . -name 'tsconfig.json' -exec grep -l 'strict.*true' {} +` or `Bash: ls .eslintrc* 2>/dev/null`) to make the skill more actionable.
Remove explanatory text that Claude already knows (e.g., what MCP servers do, what subagents are for, what hooks accomplish) and keep only the mapping logic from detected signals to recommendations.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~250+ lines. It includes extensive tables for harnessability scoring, automation recommendations, MCP servers, slash commands, and subagents that could easily be split into referenced files. Much of this content (like explaining what hooks do, what MCP servers are for) assumes Claude doesn't know things it already does. The 'Recommend-Automation' section alone is massive and should be a separate reference file. | 1 / 3 |
Actionability | The skill provides concrete detection patterns (e.g., check for `tsconfig.json` with `strict:true`, look for `.eslintrc*`) and clear output formats with markdown examples. However, there are no executable code snippets or commands — detection logic is described in prose/tables rather than as runnable scripts or specific tool invocations. The guidance is specific but not copy-paste executable. | 2 / 3 |
Workflow Clarity | The skill describes when to audit, what to output, and persistence rules, but lacks a clear numbered step-by-step workflow for actually performing the audit. There's no explicit sequence like 'Step 1: Read existing audit, Step 2: Scan file tree, Step 3: Detect stack...' with validation checkpoints. The 'When to Reaudit' section and token economy rules hint at validation but don't form a coherent feedback loop. | 2 / 3 |
Progressive Disclosure | The skill references several external files (`policies/harness-categories.md`, `docs/skill-guides/repo-auditor.md`, `templates/audit.md`, `policies/handoffs.md`, etc.) which is good progressive disclosure design. However, the body itself is monolithic — the harnessability scoring table, recommend-automation section, output split catalog, and detection rules are all inline when they should be in referenced files. The references exist but the content that should be delegated to them remains in the main body. | 2 / 3 |
Total | 7 / 12 Passed |