he-compound

Run a bounded Harness Engineering lifecycle across multiple stages. Use when the user wants coordinated brainstorm, spec, plan, work, review, and fix flow rather than one isolated stage.

Quality

51%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./Plugins/harness-engineering/fixtures/budget-archive/2026-04-21/deferred-store/skills/team_automation/he-compound/SKILL.md

Quality

Content

27%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill reads as an abstract policy document rather than actionable guidance. It defines rules and constraints for a 'Harness Engineering Compound' workflow but never provides concrete commands, file templates, example outputs, or executable steps. The heavy use of internal jargon (lifecycle mode, artifact-first evidence, context-disposition policy) without definitions or examples makes it difficult to follow, and the lack of any supporting bundle files or linked references leaves critical gaps in understanding.

Suggestions

Add concrete, executable examples showing what 'Select lifecycle mode using artifact-first evidence' looks like in practice—include sample inputs, decision criteria, and expected outputs with the schema_version:1 format.

Provide links or bundle files for referenced concepts like 'skill-refactor', 'skillify', 'he-compound-refresh', and the Harness Engineering stages so the skill can serve as a true progressive-disclosure entrypoint.

Replace abstract procedure steps with specific commands or decision trees, e.g., 'Check for docs/solutions/*.md matching the topic → if found and overlap > 80%, refresh existing file → else create new file at docs/solutions/<topic>.md'.

Add at least one complete worked example showing a request flowing through mode selection, stage routing, and output generation with actual file contents.

Dimension	Reasoning	Score
Conciseness	The content is moderately efficient but includes philosophical framing and abstract descriptions that don't add actionable value. Phrases like 'Progressive-disclosure entrypoint for stage orchestration and durable learning capture' are jargon-heavy without being instructive. Some sections (e.g., 'Philosophy') could be cut entirely.	2 / 3
Actionability	The skill provides no concrete code, commands, file paths, or executable steps. The procedure is entirely abstract ('Select lifecycle mode using artifact-first evidence') with no specifics on how to actually perform any step. There are no concrete examples of inputs/outputs, no command syntax, and no executable guidance.	1 / 3
Workflow Clarity	There is a numbered procedure with a logical sequence and a validation section with explicit fail-fast behavior. However, the steps are abstract and lack concrete validation checkpoints—'Confirm mode selection matches available evidence' doesn't specify how to confirm or what evidence looks like. The workflow reads more like policy than an executable sequence.	2 / 3
Progressive Disclosure	The skill references several external concepts (skill-refactor, skillify, he-compound-refresh, docs/solutions/, Harness Engineering stages) but provides no links to documentation for any of them. No bundle files are provided. The content is a monolithic block of abstract rules with no clear navigation to supporting materials that would make it actionable.	1 / 3
	Total	6 / 12 Passed

Description

75%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description has strong completeness and distinctiveness, clearly stating both what it does and when to use it, and explicitly differentiating from single-stage skills. However, the specificity of capabilities could be improved by describing what each stage actually does, and the trigger terms lean toward internal jargon ('Harness Engineering lifecycle') rather than natural user language.

Suggestions

Add brief concrete descriptions of what each stage produces (e.g., 'brainstorm ideas, write a specification document, create an implementation plan, execute code changes, review output, and fix issues').

Include more natural trigger terms users might say, such as 'full workflow', 'end-to-end development', 'multi-step project', or 'complete development cycle'.

Dimension	Reasoning	Score
Specificity	It names the domain (Harness Engineering lifecycle) and lists the stages (brainstorm, spec, plan, work, review, fix), but these are stage names rather than concrete actions describing what each stage does. It's more of a list of phases than specific capabilities.	2 / 3
Completeness	It clearly answers both 'what' (run a bounded Harness Engineering lifecycle across multiple stages) and 'when' (when the user wants coordinated multi-stage flow rather than one isolated stage), with an explicit 'Use when' clause and a distinguishing contrast against single-stage usage.	3 / 3
Trigger Term Quality	Terms like 'brainstorm', 'spec', 'plan', 'review', and 'fix' are somewhat natural, but 'Harness Engineering lifecycle' and 'bounded' are jargon-heavy. Users are more likely to say things like 'run the full workflow' or 'end-to-end development' rather than 'coordinated brainstorm, spec, plan, work, review, and fix flow'.	2 / 3
Distinctiveness Conflict Risk	The description explicitly distinguishes itself from individual stage skills by specifying 'coordinated...flow rather than one isolated stage,' which creates a clear niche and reduces conflict with skills that handle individual stages like brainstorm or review alone.	3 / 3
	Total	10 / 12 Passed

Validation

90%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 10 / 11 Passed

Validation for skill structure

Criteria	Description	Result
metadata_version	'metadata.version' is missing	Warning

	Total	10 / 11 Passed

Repository: jscraik/Agent-Skills
Commit: 8e7e19d

Reviewed: 5 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.