he-technical-review

Review diffs, PRs, specs, plans, or review-feedback items and return severity-ranked engineering findings with exact locations. Use when technical risks or feedback correctness must be verified before implementation.

Quality

62%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./Plugins/harness-engineering/fixtures/budget-archive/2026-04-21/deferred-store/skills/code_quality_review/he-technical-review/SKILL.md

Quality

Discovery

67%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description has good structural completeness with both 'what' and 'when' clauses clearly stated, and the output format (severity-ranked findings with exact locations) adds useful specificity. However, the breadth of inputs (diffs, PRs, specs, plans, review-feedback) makes it somewhat generic and potentially overlapping with other review-oriented skills, and the trigger terms could better match natural user language.

Suggestions

Add more natural trigger terms users would actually say, such as 'code review', 'pull request review', 'design doc review', 'audit code', or 'check for issues'.

Narrow or better differentiate the scope—clarify what distinguishes this from a general code review or PR summary skill, e.g., emphasize the severity-ranking and risk-verification angle more distinctly.

Dimension	Reasoning	Score
Specificity	Names several input types (diffs, PRs, specs, plans, review-feedback items) and describes the output (severity-ranked engineering findings with exact locations), but the actions themselves are somewhat vague—'review' is broad and the specific concrete operations beyond ranking are not enumerated.	2 / 3
Completeness	Clearly answers both 'what' (review diffs/PRs/specs/plans and return severity-ranked findings with exact locations) and 'when' (use when technical risks or feedback correctness must be verified before implementation), with an explicit 'Use when' clause.	3 / 3
Trigger Term Quality	Includes some relevant terms like 'diffs', 'PRs', 'specs', 'review-feedback', and 'technical risks', but misses common natural variations users might say such as 'code review', 'pull request', 'design review', 'audit', or 'check for bugs'. The phrase 'feedback correctness must be verified' is somewhat unnatural.	2 / 3
Distinctiveness Conflict Risk	The scope is fairly broad—reviewing diffs, PRs, specs, and plans could overlap with general code review skills, PR summary skills, or architecture review skills. The 'severity-ranked findings with exact locations' output format adds some distinctiveness, but the wide input range increases conflict risk.	2 / 3
	Total	9 / 12 Passed

Implementation

57%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured process-oriented skill with good progressive disclosure and clear routing guidance. Its main weaknesses are the lack of concrete, executable examples (no actual output schema shown, no finding template inline) and somewhat abstract procedural steps that describe intent rather than specific actions. The skill would benefit significantly from a concrete output example and more specific validation checkpoints.

Suggestions

Add a concrete example of a findings output (even abbreviated) showing the actual schema with severity, location, impact, minimal fix, and confidence fields so Claude knows exactly what to produce.

Define P0-P3 severity levels explicitly or reference a specific file that contains the definitions.

Integrate validation checkpoints directly into the procedure steps (e.g., after step 2: 'CHECKPOINT: verify each finding has all required fields before proceeding') rather than listing them in a separate section.

Dimension	Reasoning	Score
Conciseness	The skill is reasonably structured but includes some unnecessary verbosity—e.g., the 'Philosophy' section is two lines that restate what the procedure already implies, the 'Examples' section lists natural-language prompts rather than adding technical value, and some sections like 'Anti-Patterns' could be tighter. However, it doesn't over-explain concepts Claude already knows.	2 / 3
Actionability	The procedure provides a numbered sequence of steps but they are abstract and descriptive rather than concrete—no executable commands, no code snippets, no specific tool invocations, no structured output template. The outputs section mentions a schema but doesn't show it. Findings severity levels (P0-P3) are mentioned but not defined. The guidance tells Claude what to do conceptually but not how to do it concretely.	2 / 3
Workflow Clarity	The procedure has a clear 6-step sequence and the validation section adds checkpoint-like criteria. However, the validation steps are described as assertions rather than actionable checkpoints integrated into the workflow. The 'fail fast' instruction is mentioned but not tied to specific steps. For a review skill involving potentially destructive feedback implementation, the feedback loops are implicit rather than explicit (e.g., 'clarify unclear items, verify, then respond' lacks concrete gates).	2 / 3
Progressive Disclosure	The skill is well-organized as an overview with clear sections and well-signaled one-level-deep references to contract.yaml, evals.yaml, task-profile.json, finding.md.tmpl, and routing documents. The 'Read when' annotations at the bottom provide contextual guidance for when to consult references. Route-elsewhere guidance is clear and specific.	3 / 3
	Total	9 / 12 Passed

Validation

90%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 10 / 11 Passed

Validation for skill structure

Criteria	Description	Result
metadata_version	'metadata.version' is missing	Warning

	Total	10 / 11 Passed

Repository: jscraik/Agent-Skills
Commit: d00c351

Reviewed: about 24 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.