Reviews optimization patches using a 5-persona peer review system. Requires unanimous approval backed by benchmarks.
50
55%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./.claude/skills/lading-optimize-review/SKILL.mdQuality
Discovery
32%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description conveys a unique review mechanism (5-persona peer review with unanimous approval) but is too terse and lacks explicit trigger guidance. It relies on internal jargon ('5-persona') that users wouldn't naturally use, and omits a 'Use when...' clause, making it difficult for Claude to reliably select this skill from a large pool.
Suggestions
Add an explicit 'Use when...' clause, e.g., 'Use when the user submits an optimization patch for review, asks for performance review of code changes, or requests approval of optimization PRs.'
Include natural trigger terms users would say, such as 'code review', 'performance patch', 'optimize', 'PR review', 'benchmark results'.
Expand the 'what' with more concrete actions, e.g., 'Evaluates correctness, performance impact, and safety of optimization patches using a 5-persona peer review system.'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | It names a specific domain (optimization patches) and describes the mechanism (5-persona peer review, unanimous approval, benchmarks), but doesn't list the concrete actions performed during the review (e.g., checking correctness, measuring performance, validating safety). | 2 / 3 |
Completeness | It describes what the skill does (reviews optimization patches via peer review) but completely lacks a 'Use when...' clause or any explicit trigger guidance for when Claude should select this skill. Per the rubric, a missing 'Use when...' clause caps completeness at 2, and the 'what' is also only partially described, warranting a 1. | 1 / 3 |
Trigger Term Quality | Includes some relevant terms like 'optimization patches', 'peer review', and 'benchmarks', but misses natural user phrases like 'review my patch', 'code review', 'performance optimization', or 'approve changes'. The term '5-persona' is internal jargon unlikely to be used by a user. | 2 / 3 |
Distinctiveness Conflict Risk | The '5-persona peer review system' and 'optimization patches' are somewhat distinctive, but 'reviews patches' could overlap with general code review skills. The description doesn't clearly delineate its niche from broader code review or optimization skills. | 2 / 3 |
Total | 7 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-structured, highly actionable skill that defines a rigorous multi-persona review process with clear phases, concrete commands, and explicit decision criteria. Its main strengths are the executable benchmark commands, specific numerical thresholds, and comprehensive checklists. Minor weaknesses include the duplicated outcomes table and the inability to verify referenced template/asset files since no bundle was provided.
Suggestions
Remove the duplicated outcomes table (appears in both the top section and Phase 4) to improve conciseness.
Consider condensing the 'NO EXCEPTIONS' section into a single line like 'Any excuse to skip benchmarks → REJECT' since the individual cases all convey the same rule.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is mostly efficient and avoids explaining concepts Claude already knows, but there's some redundancy — the outcomes table appears twice identically, and some sections like 'NO EXCEPTIONS' are somewhat verbose for what they convey. The arguments table and report ID generation are appropriately detailed since they're project-specific. | 2 / 3 |
Actionability | The skill provides fully executable bash commands with concrete arguments, specific threshold values (>=5% time, >=10% mem, >=20% allocs), detailed checklists for each persona, and clear references to template files and paths. The benchmark commands are copy-paste ready with proper argument substitution. | 3 / 3 |
Workflow Clarity | The 5-phase workflow is clearly sequenced with explicit validation checkpoints throughout — baseline verification before proceeding, statistical requirements as gates, ci/validate as a correctness check, Kani/property tests, and a clear decision matrix. Feedback loops are present (e.g., 'If Kani fails to run' fallback, duplicate found -> REJECT with reference). Missing arguments and missing baselines are caught early with explicit REJECT outcomes. | 3 / 3 |
Progressive Disclosure | The skill references external template files (approved.template.yaml, rejected.template.yaml) and a database file (db.yaml) appropriately, but no bundle files were provided to verify these exist. The skill itself is fairly long (~170 lines of substantive content) and some sections like the persona checklists could potentially be split into a referenced file, though the inline approach is defensible for a review checklist that must be consulted every time. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
e98e305
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.