he-compound-refresh

Analyze and validate compound Harness Engineering run state, blockers, validation status, and Linear context. Use when lifecycle runs drift, gates fail, blockers appear, or compound work needs refresh.

Quality

62%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./Plugins/harness-engineering/fixtures/budget-archive/2026-04-21/deferred-store/skills/team_automation/he-compound-refresh/SKILL.md

Quality

Content

50%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill provides a structured framework for analyzing compound Harness Engineering run states, with clear failure modes and anti-patterns. However, it suffers from abstract, process-level guidance without concrete executable examples, commands, or specific criteria for its classification outcomes. The workflow is sequenced but lacks the explicit validation checkpoints and feedback loops needed for the complexity of the operations described.

Suggestions

Add concrete examples showing what each classification outcome (Continue, Review, Fix, etc.) looks like with specific evidence patterns, rather than just listing the labels.

Include at least one executable command or tool invocation example (e.g., the session-collector command, a validation command) to make the procedure actionable rather than purely conceptual.

Add explicit validation checkpoints between procedure steps (e.g., 'After step 2, verify all targets are inventoried before proceeding to step 3') to strengthen the feedback loop for error recovery.

Consolidate overlapping sections—Philosophy, Failure Modes, Constraints, and Anti-patterns all contain related guidance about not guessing or proceeding without evidence—into fewer, more focused sections.

Dimension	Reasoning	Score
Conciseness	The skill is moderately efficient but includes some unnecessary philosophical framing and redundant phrasing. Sections like 'Philosophy' and 'Anti-patterns' overlap with 'Failure Modes' and 'Constraints'. Some bullet points could be tightened, but it avoids explaining basic concepts Claude already knows.	2 / 3
Actionability	The procedure provides a numbered sequence of steps, but they are abstract and process-oriented rather than concrete. There are no executable commands, code snippets, or specific tool invocations—just conceptual guidance like 'classify each target into exactly one maintenance outcome.' The classification labels are listed but not defined with concrete criteria.	2 / 3
Workflow Clarity	The 8-step procedure provides a clear sequence, and the Validation section includes gate-checking ('stop at first failed gate'). However, the steps lack explicit validation checkpoints between them, and the feedback loop for error recovery is only implicitly described ('stale-mark ambiguous cases'). For a workflow involving potentially destructive document operations, the validation integration could be more explicit.	2 / 3
Progressive Disclosure	The skill references an external file (session-evidence-contract.md) and mentions assets, showing some progressive disclosure structure. However, no bundle files were provided to verify these references exist, and the skill itself is somewhat monolithic with many sections that could be split. The 'Full Context' section pointing to icon assets is not meaningful for operational guidance.	2 / 3
	Total	8 / 12 Passed

Description

75%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description has a well-structured format with explicit 'what' and 'when' clauses, and occupies a clearly distinct niche around Harness Engineering run management. Its main weakness is that the actions described are somewhat abstract ('analyze', 'validate') rather than listing concrete operations, and the trigger terms are heavily jargon-laden, which may not match how users naturally phrase their requests.

Suggestions

Replace abstract verbs like 'analyze' and 'validate' with more concrete actions, e.g., 'Check run gate status, identify blocking dependencies, surface Linear ticket context, diagnose lifecycle drift'.

Add plain-language trigger term variations that users might naturally say, e.g., 'stuck run', 'pipeline blocked', 'run failing', 'check run status'.

Dimension	Reasoning	Score
Specificity	Names the domain (Harness Engineering runs) and lists several actions (analyze, validate state, blockers, validation status, Linear context), but these are somewhat abstract rather than concrete operations. Terms like 'analyze' and 'validate' are broad.	2 / 3
Completeness	Clearly answers both 'what' (analyze and validate compound Harness Engineering run state, blockers, validation status, and Linear context) and 'when' (lifecycle runs drift, gates fail, blockers appear, or compound work needs refresh) with explicit trigger conditions.	3 / 3
Trigger Term Quality	Includes some relevant domain-specific terms like 'Harness Engineering', 'run state', 'blockers', 'validation status', 'Linear context', 'gates fail', and 'compound work'. However, these are highly specialized jargon that may not match natural user language, and common variations or plain-language equivalents are missing.	2 / 3
Distinctiveness Conflict Risk	The description is highly specific to 'Harness Engineering' compound runs with Linear context, creating a clear niche that is unlikely to conflict with other skills. The combination of domain-specific terms makes it very distinguishable.	3 / 3
	Total	10 / 12 Passed

Validation

90%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 10 / 11 Passed

Validation for skill structure

Criteria	Description	Result
metadata_version	'metadata.version' is missing	Warning

	Total	10 / 11 Passed

Repository: jscraik/Agent-Skills
Commit: 8e7e19d

Reviewed: 5 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.