Analyze and validate compound Harness Engineering run state, blockers, validation status, and Linear context. Use when lifecycle runs drift, gates fail, blockers appear, or compound work needs refresh.
50
55%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./Plugins/harness-engineering/fixtures/budget-archive/2026-04-21/deferred-store/skills/team_automation/he-compound-refresh/SKILL.mdQuality
Discovery
75%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description has a clear structure with both 'what' and 'when' clauses, and targets a very specific domain (Harness Engineering runs with Linear integration), making it distinctive. However, the specific actions could be more concrete (listing discrete operations rather than general verbs like 'analyze' and 'validate'), and the trigger terms lean heavily on internal jargon rather than natural user language.
Suggestions
Replace generic verbs like 'analyze and validate' with more concrete actions such as 'check gate pass/fail status, identify and resolve blockers, sync run state with Linear tickets'.
Add more natural trigger terms users might actually say, such as 'run stuck', 'pipeline blocked', 'gate check', 'run status', or 'refresh run' alongside the existing jargon.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Names the domain (Harness Engineering runs) and lists some actions like 'analyze', 'validate', but the specific capabilities are somewhat vague — 'run state', 'blockers', 'validation status', and 'Linear context' hint at concrete concepts but don't describe specific actions like 'check gate status', 'resolve blockers', or 'sync with Linear tickets'. | 2 / 3 |
Completeness | Clearly answers both 'what' (analyze and validate compound Harness Engineering run state, blockers, validation status, and Linear context) and 'when' (lifecycle runs drift, gates fail, blockers appear, or compound work needs refresh) with an explicit 'Use when' clause containing specific trigger conditions. | 3 / 3 |
Trigger Term Quality | Includes some relevant terms like 'blockers', 'gates fail', 'lifecycle runs', 'Linear context', and 'compound work', but these are fairly domain-specific jargon. A user might say 'my run is stuck' or 'check blockers' but terms like 'compound Harness Engineering run state' and 'drift' are not natural user language. Missing common variations of how users would phrase requests. | 2 / 3 |
Distinctiveness Conflict Risk | Highly specific to 'Harness Engineering' compound runs with Linear integration — this is a very narrow niche that is unlikely to conflict with other skills. The combination of Harness Engineering, Linear context, and lifecycle run concepts creates a distinct identity. | 3 / 3 |
Total | 10 / 12 Passed |
Implementation
35%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill provides a structured framework for analyzing compound Harness Engineering run state but suffers significantly from lack of actionability — there are no concrete commands, tool invocations, schema definitions, or executable examples. The workflow is sequenced but abstract, with classification outcomes listed without decision criteria. The skill reads more like a process philosophy document than an operational guide Claude can execute.
Suggestions
Add concrete, executable examples: provide at least one complete input/output pair showing the markdown report format, the schema_version:1 structured output, and specific commands or tool calls used to inspect live state.
Define the classification criteria for the 13 maintenance outcomes (Continue, Review, Fix, etc.) with concrete decision rules or a decision table, rather than just listing them.
Integrate validation checkpoints directly into the procedure steps (e.g., after step 4, add an explicit gate: 'If check X fails, stop and report Y') rather than listing them in a separate section.
Provide the actual output schema referenced by 'schema_version: 1' so Claude knows the exact structure to produce.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is moderately efficient but includes some unnecessary philosophical preamble and verbose explanations. Sections like 'Philosophy' and 'Anti-patterns' add useful but somewhat redundant guidance that could be tightened. The 'Full Context' section pointing to icon assets adds no operational value. | 2 / 3 |
Actionability | The skill provides no concrete code, commands, or executable examples. The procedure is entirely abstract ('Re-run or inspect the smallest safe live-state checks') with no specific commands, tool invocations, or structured output templates. The 'schema_version: 1' output mention lacks any actual schema definition. The examples section contains only natural language prompts, not input/output pairs. | 1 / 3 |
Workflow Clarity | The 8-step procedure provides a clear sequence and the validation section includes a 'fail fast' directive, but validation checkpoints are described abstractly rather than as concrete gates within the workflow. The classification step (step 5) lists 13 possible outcomes without guidance on how to distinguish between them. The feedback loop ('stop at first failed gate') is mentioned but not integrated into the procedure steps themselves. | 2 / 3 |
Progressive Disclosure | The skill references an external file (session-evidence-contract.md) and mentions assets, showing some progressive disclosure intent. However, no bundle files were provided to verify these references exist, the 'Full Context' section only links to icons which aren't operationally relevant, and the skill contains substantial inline content (anti-patterns, constraints, failure modes) that could potentially be split out for a cleaner overview. | 2 / 3 |
Total | 7 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
metadata_version | 'metadata.version' is missing | Warning |
Total | 10 / 11 Passed | |
4c78f98
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.