CtrlK
BlogDocsLog inGet started
Tessl Logo

arn-code-review-implementation

This skill should be used when the user says "review implementation", "review the project", "check implementation", "quality review", "validate implementation", "implementation review", or wants a post-execution quality gate to verify that the implementation follows the project's stored code and testing patterns and matches the plan. Reports issues as ERRORS, WARNINGS, INFO with a verdict. Do NOT use this for reviewing PRs (use arn-code-review-pr) or validating plans (use arn-code-review-plan).

68

Quality

83%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Risky

Do not use without reviewing

SKILL.md
Quality
Evals
Security

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured, highly actionable implementation review skill with clear workflows, explicit check IDs, severity classifications, and a precise output format. Its main weakness is length — the document packs a lot of conditional sub-workflows (visual regression layers, cross-layer comparison, sketch promotion) inline that could benefit from being split into reference files. The content is substantive rather than padded, but could be tightened in places.

Suggestions

Extract Steps 3b (Visual Regression Review), 3c (Cross-Layer Comparison), and 3d (Sketch Promotion Verification) into separate reference files to reduce the main SKILL.md length and improve progressive disclosure.

Remove framing sentences like 'This is a self-contained review -- it does not invoke sub-agents' and 'This is NOT a hardcoded checklist' that explain meta-concepts Claude doesn't need.

DimensionReasoningScore

Conciseness

The skill is thorough but verbose in places. The pipeline position diagram and some explanatory text (e.g., 'This is a self-contained review -- it does not invoke sub-agents') add tokens without much value for Claude. The check tables are efficient, but the overall document is quite long with some redundant framing. However, most content is genuinely instructive rather than explaining things Claude already knows.

2 / 3

Actionability

The skill provides highly concrete, structured checklists with specific check IDs, severity levels, exact file paths to read, exact conditions to evaluate, and a precise output format. The workflow is fully specified with clear decision logic (verdict rules, conditional steps, error handling). Claude knows exactly what to do at each step.

3 / 3

Workflow Clarity

The multi-step workflow is clearly sequenced (Steps 1-5) with explicit conditional gates (e.g., 'Only execute this step if 2 or more active layers produced captures'), validation checkpoints throughout (PC04 file existence checks, visual regression thresholds), error recovery paths in the Error Handling section, and a structured verdict system that drives next actions. Feedback loops are present (NEEDS FIXES -> ask user to fix).

3 / 3

Progressive Disclosure

The skill references a bundle file (`cross-layer-comparison-guide.md`) which is good progressive disclosure, but the main SKILL.md itself is very long (~250+ lines of detailed content) with Steps 3, 3b, 3c, and 3d all inline. Some of these conditional sub-steps (visual regression, sketch promotion, cross-layer comparison) could be split into separate reference files to keep the main skill leaner. No bundle files were provided to verify the reference path.

2 / 3

Total

10

/

12

Passed

Description

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong skill description with excellent trigger term coverage and explicit 'when to use' guidance including negative boundaries that distinguish it from related skills. The main weakness is that the 'what it does' portion could be more specific about the concrete actions performed during the review. Overall, it would perform well in a multi-skill selection scenario.

Suggestions

Add more specific concrete actions the skill performs, e.g., 'checks code style adherence, validates test coverage, verifies architectural patterns, confirms plan alignment' to improve specificity.

DimensionReasoningScore

Specificity

The description mentions some actions like 'verify that the implementation follows the project's stored code and testing patterns and matches the plan' and 'Reports issues as ERRORS, WARNINGS, INFO with a verdict,' but it doesn't list multiple concrete actions comprehensively — it's more about the overall purpose than specific capabilities.

2 / 3

Completeness

Clearly answers both 'what' (verifies implementation follows stored code/testing patterns and matches the plan, reports issues as ERRORS/WARNINGS/INFO) and 'when' (explicit trigger phrases listed, plus negative boundaries distinguishing it from PR review and plan validation skills).

3 / 3

Trigger Term Quality

Excellent coverage of natural trigger terms: 'review implementation', 'review the project', 'check implementation', 'quality review', 'validate implementation', 'implementation review', and 'post-execution quality gate'. These are phrases users would naturally say.

3 / 3

Distinctiveness Conflict Risk

Explicitly distinguishes itself from related skills (arn-code-review-pr for PRs, arn-code-review-plan for plans) and carves out a clear niche as a post-execution implementation quality gate. The negative triggers ('Do NOT use this for...') significantly reduce conflict risk.

3 / 3

Total

11

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
AppsVortex/arness
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.