CtrlK
BlogDocsLog inGet started
Tessl Logo

he-reconcile

Analyze repo, tracker, PR, validation, session, and .harness evidence. Use when multi-stage Harness Engineering work needs safe resume routing.

35

Quality

30%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./Plugins/harness-engineering/skills/he-reconcile/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill reads as an internal coordination protocol with heavy domain-specific jargon but lacks the concrete, executable guidance needed for effective skill content. The procedure is abstract rather than actionable, with no code examples, command templates, or structured output samples despite defining a detailed output schema. The progressive disclosure structure references many files but the organization is cluttered and the core content defers too much critical detail to unavailable references.

Suggestions

Add a concrete, complete example of the structured output format (JSON/YAML) showing all fields like schema_version, mode, stage_map, etc. with realistic values so Claude knows exactly what to produce.

Replace the abstract procedure steps with specific, executable instructions — e.g., 'Read .harness/tracker/*.yaml to extract stage status' or 'Run `grep -r stage_status .harness/` to find incomplete stages' rather than 'Reconstruct lifecycle state from live repo'.

Consolidate the repeated 'See references/hot-path-folded-context.md' references into a single note at the top or bottom, and organize the References section into a clean list with one-line descriptions of when each reference applies.

Expand the Examples section with at least one full input→output example showing the complete structured output for a realistic reconciliation scenario.

DimensionReasoningScore

Conciseness

The skill is moderately efficient but contains significant jargon-heavy prose that could be tightened. Terms like 'context-disposition policy', 'blackboard_delta', and 'coding_lens' are used without definition, and several sections repeat the pattern of listing concerns without adding clarity. The repeated 'See references/hot-path-folded-context.md' lines add noise.

2 / 3

Actionability

The skill provides no concrete code, commands, or executable examples. The procedure is described abstractly ('Reconstruct lifecycle state from live repo...') without specific tool invocations, file paths to inspect, or command-line operations. The examples section describes scenarios in prose rather than showing actual inputs/outputs or structured output examples.

1 / 3

Workflow Clarity

There is a numbered procedure with a logical sequence, and validation/failure handling sections exist with explicit gate statuses (pass/fail/blocked). However, the procedure steps are abstract and lack concrete validation checkpoints between steps. Critical detail is deferred to 'references/hot-path-folded-context.md' which is not provided, making the actual workflow incomplete as written.

2 / 3

Progressive Disclosure

The skill references multiple external files (contract.yaml, evals.yaml, hot-path-folded-context.md, stage-arc-boundary-contract.md, etc.) which shows intent for progressive disclosure. However, none of the bundle files are provided, references are numerous and somewhat disorganized (some inline, some in References section, some in other sections), and the repeated 'See references/hot-path-folded-context.md' pattern across multiple sections suggests content organization could be cleaner.

2 / 3

Total

7

/

12

Passed

Description

25%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This description is heavily laden with internal jargon ('Harness Engineering', 'safe resume routing') that would be opaque to most users and to Claude when selecting among skills. It fails to describe concrete actions and relies on abstract terms. While it attempts a 'Use when' clause, both the capability description and trigger guidance are too vague to be effective.

Suggestions

Replace abstract language like 'analyze...evidence' with specific concrete actions (e.g., 'Validates session state from .harness files, checks PR completion status, determines safe resume points').

Rewrite the 'Use when...' clause with natural trigger terms a user might actually say (e.g., 'Use when resuming an interrupted harness workflow, checking harness session state, or validating multi-stage pipeline progress').

Clarify what 'Harness Engineering' means in context and what 'safe resume routing' actually produces as an output, so Claude can distinguish this skill from general CI/CD or repo analysis skills.

DimensionReasoningScore

Specificity

The description uses vague terms like 'analyze...evidence' and 'safe resume routing' without listing concrete actions. It's unclear what specific operations are performed—no verbs like 'extract', 'validate', 'merge', or 'generate' are present.

1 / 3

Completeness

It has a 'Use when...' clause addressing the 'when' question, but the 'what' is extremely vague ('analyze...evidence'). The 'when' clause itself ('multi-stage Harness Engineering work needs safe resume routing') is also unclear and jargon-heavy, making both halves weak though technically present.

2 / 3

Trigger Term Quality

The terms 'Harness Engineering', 'safe resume routing', and 'multi-stage' are highly specialized jargon that users would not naturally say. Terms like 'repo', 'PR', 'tracker' are generic and would overlap with many other skills without providing clear trigger value.

1 / 3

Distinctiveness Conflict Risk

The mention of 'Harness Engineering' and '.harness evidence' provides some niche specificity, but terms like 'repo', 'PR', 'tracker', 'session', and 'validation' are extremely generic and could easily conflict with version control, CI/CD, project management, or testing skills.

2 / 3

Total

6

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
jscraik/Agent-Skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.