Content
85%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a strong, well-structured skill that provides clear executable guidance for running bounded experiment loops. Its main strengths are the explicit workflow with validation checkpoints, concrete ledger and iteration examples, and well-organized progressive disclosure. The primary weakness is moderate verbosity with some redundancy across sections (Avoid/Anti-Patterns, Constraints/Decision Language/Gotchas) that could be consolidated to save tokens.
Suggestions
Consolidate overlapping sections: merge 'Avoid' into 'Anti-Patterns', and deduplicate guard/regression rules that appear in both 'Decision Language', 'Constraints', and 'Gotchas'.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is fairly dense and avoids explaining basic concepts, but some sections are verbose or redundant (e.g., 'Decision Language' overlaps with 'Constraints' and 'Gotchas'; the 'Avoid' and 'Anti-Patterns' sections partially overlap). Several bullet points could be tightened without losing meaning. | 2 / 3 |
Actionability | The skill provides concrete executable examples (shell commands, YAML ledger entries, specific verify/guard commands), a clear iteration example with real command output, and specific decision criteria (baseline - candidate >= min_delta). The ledger entry template and iteration example are copy-paste ready. | 3 / 3 |
Workflow Clarity | The workflow is clearly sequenced (9 numbered steps) with explicit validation checkpoints (baseline first, verify then guard, keep/discard/crash/block with evidence). Feedback loops are well-defined: fail fast at first failed gate, repair smallest failing unit, rerun before proceeding. The decision criteria include explicit guard regression checks. | 3 / 3 |
Progressive Disclosure | The skill ends with a clear 'Progressive Disclosure' section pointing to one-level-deep references (references/autoresearch-project.md, references/contract.yaml, references/evals.yaml, references/task-profile.json). The main body serves as an effective overview, and the discovery interview section also references references/discovery-interview.md. Navigation is well-signaled. | 3 / 3 |
Total | 11 / 12 Passed |