Content
70%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-structured instruction-only skill that clearly defines the experiment lifecycle workflow, transitions to other skills, and boundaries of responsibility. Its main weakness is moderate verbosity in framing sections and reliance on external references for concrete execution details, making the skill itself more of a philosophical guide than a hands-on playbook. The progressive disclosure and workflow clarity are strong points.
Suggestions
Trim the opening paragraph and 'Your Stance' section — Claude doesn't need motivational framing like 'You are the empirical engine' or 'This is where you earn your keep.'
Add a concrete example of a complete experiment record (even abbreviated) inline, showing the flow from assumption → experiment design → result → assumption update, rather than deferring all concrete details to referenced files.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is mostly efficient but includes some unnecessary philosophical framing ('You are the empirical engine of the discovery system') and explanatory prose that Claude doesn't need. The 'Your Stance' section, while useful, could be tighter. The outcome propagation and transitions sections are well-structured but slightly verbose. | 2 / 3 |
Actionability | The skill provides concrete guidance on success criteria and pre-commitment with good examples ('7 of 10 users complete the task without asking for help'), and clear instructions for outcome propagation. However, it lacks executable code/commands and relies heavily on references for the actual how-to (design, lifecycle tracking). The skill itself is more directional than copy-paste actionable. | 2 / 3 |
Workflow Clarity | The experiment lifecycle is clearly sequenced: design with success criteria → pre-commit actions → run → record results → update assumption → review parent ideas → check shared assumptions → suggest next actions. The outcome propagation section provides an explicit multi-step process with validation-like checkpoints (review impact, check shared assumptions). Transitions to other skills are clearly defined with trigger conditions. | 3 / 3 |
Progressive Disclosure | The skill provides a clear overview with well-signaled one-level-deep references: design principles in references/design-experiment.md, lifecycle tracking in references/experiment-lifecycle.md, schemas in experiment-record.md and assumption.md, and the artifacts skill for writing guidance. Content is appropriately split between the overview and referenced files. | 3 / 3 |
Total | 10 / 12 Passed |