CtrlK
BlogDocsLog inGet started
Tessl Logo

he-compound-refresh

Analyze and validate compound Harness Engineering run state, blockers, validation status, and Linear context. Use when lifecycle runs drift, gates fail, blockers appear, or compound work needs refresh.

55

Quality

62%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./Plugins/harness-engineering/fixtures/budget-archive/2026-04-21/deferred-store/skills/team_automation/he-compound-refresh/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

50%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill provides a structured framework for analyzing compound Harness Engineering run states, with clear failure modes and anti-patterns. However, it suffers from abstract, process-level guidance without concrete executable examples, commands, or specific criteria for its classification outcomes. The workflow is sequenced but lacks the explicit validation checkpoints and feedback loops needed for the complexity of the operations described.

Suggestions

Add concrete examples showing what each classification outcome (Continue, Review, Fix, etc.) looks like with specific evidence patterns, rather than just listing the labels.

Include at least one executable command or tool invocation example (e.g., the session-collector command, a validation command) to make the procedure actionable rather than purely conceptual.

Add explicit validation checkpoints between procedure steps (e.g., 'After step 2, verify all targets are inventoried before proceeding to step 3') to strengthen the feedback loop for error recovery.

Consolidate overlapping sections—Philosophy, Failure Modes, Constraints, and Anti-patterns all contain related guidance about not guessing or proceeding without evidence—into fewer, more focused sections.

DimensionReasoningScore

Conciseness

The skill is moderately efficient but includes some unnecessary philosophical framing and redundant phrasing. Sections like 'Philosophy' and 'Anti-patterns' overlap with 'Failure Modes' and 'Constraints'. Some bullet points could be tightened, but it avoids explaining basic concepts Claude already knows.

2 / 3

Actionability

The procedure provides a numbered sequence of steps, but they are abstract and process-oriented rather than concrete. There are no executable commands, code snippets, or specific tool invocations—just conceptual guidance like 'classify each target into exactly one maintenance outcome.' The classification labels are listed but not defined with concrete criteria.

2 / 3

Workflow Clarity

The 8-step procedure provides a clear sequence, and the Validation section includes gate-checking ('stop at first failed gate'). However, the steps lack explicit validation checkpoints between them, and the feedback loop for error recovery is only implicitly described ('stale-mark ambiguous cases'). For a workflow involving potentially destructive document operations, the validation integration could be more explicit.

2 / 3

Progressive Disclosure

The skill references an external file (session-evidence-contract.md) and mentions assets, showing some progressive disclosure structure. However, no bundle files were provided to verify these references exist, and the skill itself is somewhat monolithic with many sections that could be split. The 'Full Context' section pointing to icon assets is not meaningful for operational guidance.

2 / 3

Total

8

/

12

Passed

Description

75%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description has a well-structured format with explicit 'what' and 'when' clauses, and occupies a clearly distinct niche around Harness Engineering run management. Its main weakness is that the actions described are somewhat abstract ('analyze', 'validate') rather than listing concrete operations, and the trigger terms are heavily jargon-laden, which may not match how users naturally phrase their requests.

Suggestions

Replace abstract verbs like 'analyze' and 'validate' with more concrete actions, e.g., 'Check run gate status, identify blocking dependencies, surface Linear ticket context, diagnose lifecycle drift'.

Add plain-language trigger term variations that users might naturally say, e.g., 'stuck run', 'pipeline blocked', 'run failing', 'check run status'.

DimensionReasoningScore

Specificity

Names the domain (Harness Engineering runs) and lists several actions (analyze, validate state, blockers, validation status, Linear context), but these are somewhat abstract rather than concrete operations. Terms like 'analyze' and 'validate' are broad.

2 / 3

Completeness

Clearly answers both 'what' (analyze and validate compound Harness Engineering run state, blockers, validation status, and Linear context) and 'when' (lifecycle runs drift, gates fail, blockers appear, or compound work needs refresh) with explicit trigger conditions.

3 / 3

Trigger Term Quality

Includes some relevant domain-specific terms like 'Harness Engineering', 'run state', 'blockers', 'validation status', 'Linear context', 'gates fail', and 'compound work'. However, these are highly specialized jargon that may not match natural user language, and common variations or plain-language equivalents are missing.

2 / 3

Distinctiveness Conflict Risk

The description is highly specific to 'Harness Engineering' compound runs with Linear context, creating a clear niche that is unlikely to conflict with other skills. The combination of domain-specific terms makes it very distinguishable.

3 / 3

Total

10

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

metadata_version

'metadata.version' is missing

Warning

Total

10

/

11

Passed

Repository
jscraik/Agent-Skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.