he-tdd

Build behavior-safe code changes with TDD and RED/GREEN evidence. Use when he-plan or he-work requires TDD for a concrete behavior target.

Quality

—

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Quality

Content

57%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is well-organized with strong progressive-disclosure signaling and a clear procedure, but it leans on external preserved files for real executable detail and has some restatement across overlapping sections. Verification of the actual RED/GREEN behavior is described rather than checkpointed with a test command.

Suggestions

Add an explicit validate-test -> fix -> re-run feedback loop with the repository test command so RED/GREEN verification is a checkpoint, not just described.

Merge or cut the overlapping "Use" / "When to use" / "Progressive Disclosure Entry" framing to remove restatement.

Inline at least one minimal RED/GREEN test example or the test command so the skill is actionable without loading external repo files.

Dimension	Reasoning	Score
Conciseness	Sections are mostly lean lists, but "When to use" restates the description and overlaps with "Use", and the "Progressive Disclosure Entry"/"Full Context" framing adds restatement that could be trimmed.	2 / 3
Actionability	A concrete audit command and concrete RED/GREEN procedural steps exist, but core instruction (test commands, interface selection) is abstract and externalized to preserved repo files that are not loadable here.	2 / 3
Workflow Clarity	The five-step Procedure is clearly sequenced with a fail-fast gate, but the in-file validation command audits the skill rather than the RED/GREEN behavior, leaving the validate-test->fix->retry feedback loop implicit and without a test command.	2 / 3
Progressive Disclosure	The entry is a concise overview with a well-signaled, one-level-deep "Full Context" section where each reference carries a "Read when:" trigger, matching the clear-overview-with-navigation anchor.	3 / 3
	Total	9 / 12 Passed

Description

85%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is specific, complete, and distinct, with an explicit Use-when trigger and third-person voice. Its main weakness is trigger-term quality, which depends on internal stage names and omits common natural phrasings.

Suggestions

Broaden the trigger terms beyond "he-plan or he-work" to include natural user phrasings like "test-first", "write tests first", or "failing test before implementation".

Keep the existing explicit "Use when" clause but pair it with at least one user-facing scenario description so the trigger reads naturally.

Dimension	Reasoning	Score
Specificity	"Build behavior-safe code changes with TDD and RED/GREEN evidence" lists multiple concrete actions (build changes, TDD, RED/GREEN evidence), matching the multi-action anchor rather than the domain-plus-some-actions anchor.	3 / 3
Completeness	It states what ("Build behavior-safe code changes with TDD and RED/GREEN evidence") and an explicit when ("Use when he-plan or he-work requires TDD for a concrete behavior target"), satisfying both halves.	3 / 3
Trigger Term Quality	Natural terms like "TDD" and "RED/GREEN" appear, but the trigger leans on internal stage names "he-plan or he-work" and omits common variations such as "test-first" or "write tests first".	2 / 3
Distinctiveness Conflict Risk	The TDD-with-RED/GREEN-evidence niche scoped to harness-engineering stages is a clear, distinct trigger unlikely to fire for unrelated skills.	3 / 3
	Total	11 / 12 Passed

Validation

81%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 13 / 16 Passed

Validation for skill structure

Criteria	Description	Result
metadata_version	'metadata.version' is missing	Warning
metadata_field	'metadata' should map string keys to string values	Warning
relative_links	Relative link issues: 12 suspicious	Warning

	Total	13 / 16 Passed

Repository: jscraik/Agent-Skills
Commit: 5a6027f

Reviewed: 5 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.