lading-optimize-review

Reviews optimization patches using a 5-persona peer review system. Requires unanimous approval backed by benchmarks.

Quality

55%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./.claude/skills/lading-optimize-review/SKILL.md

Quality

Discovery

32%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description conveys a unique review mechanism (5-persona peer review with unanimous approval) but is too terse and lacks explicit trigger guidance. It relies on internal jargon ('5-persona') that users wouldn't naturally use, and omits a 'Use when...' clause, making it difficult for Claude to reliably select this skill from a large pool.

Suggestions

Add an explicit 'Use when...' clause, e.g., 'Use when the user submits an optimization patch for review, asks for performance review of code changes, or requests approval of optimization PRs.'

Include natural trigger terms users would say, such as 'code review', 'performance patch', 'optimize', 'PR review', 'benchmark results'.

Expand the 'what' with more concrete actions, e.g., 'Evaluates correctness, performance impact, and safety of optimization patches using a 5-persona peer review system.'

Dimension	Reasoning	Score
Specificity	It names a specific domain (optimization patches) and describes the mechanism (5-persona peer review, unanimous approval, benchmarks), but doesn't list the concrete actions performed during the review (e.g., checking correctness, measuring performance, validating safety).	2 / 3
Completeness	It describes what the skill does (reviews optimization patches via peer review) but completely lacks a 'Use when...' clause or any explicit trigger guidance for when Claude should select this skill. Per the rubric, a missing 'Use when...' clause caps completeness at 2, and the 'what' is also only partially described, warranting a 1.	1 / 3
Trigger Term Quality	Includes some relevant terms like 'optimization patches', 'peer review', and 'benchmarks', but misses natural user phrases like 'review my patch', 'code review', 'performance optimization', or 'approve changes'. The term '5-persona' is internal jargon unlikely to be used by a user.	2 / 3
Distinctiveness Conflict Risk	The '5-persona peer review system' and 'optimization patches' are somewhat distinctive, but 'reviews patches' could overlap with general code review skills. The description doesn't clearly delineate its niche from broader code review or optimization skills.	2 / 3
	Total	7 / 12 Passed

Implementation

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured, highly actionable skill that defines a rigorous multi-persona review process with clear phases, concrete commands, and explicit decision criteria. Its main strengths are the executable benchmark commands, specific numerical thresholds, and comprehensive checklists. Minor weaknesses include the duplicated outcomes table and the inability to verify referenced template/asset files since no bundle was provided.

Suggestions

Remove the duplicated outcomes table (appears in both the top section and Phase 4) to improve conciseness.

Consider condensing the 'NO EXCEPTIONS' section into a single line like 'Any excuse to skip benchmarks → REJECT' since the individual cases all convey the same rule.

Dimension	Reasoning	Score
Conciseness	The skill is mostly efficient and avoids explaining concepts Claude already knows, but there's some redundancy — the outcomes table appears twice identically, and some sections like 'NO EXCEPTIONS' are somewhat verbose for what they convey. The arguments table and report ID generation are appropriately detailed since they're project-specific.	2 / 3
Actionability	The skill provides fully executable bash commands with concrete arguments, specific threshold values (>=5% time, >=10% mem, >=20% allocs), detailed checklists for each persona, and clear references to template files and paths. The benchmark commands are copy-paste ready with proper argument substitution.	3 / 3
Workflow Clarity	The 5-phase workflow is clearly sequenced with explicit validation checkpoints throughout — baseline verification before proceeding, statistical requirements as gates, ci/validate as a correctness check, Kani/property tests, and a clear decision matrix. Feedback loops are present (e.g., 'If Kani fails to run' fallback, duplicate found -> REJECT with reference). Missing arguments and missing baselines are caught early with explicit REJECT outcomes.	3 / 3
Progressive Disclosure	The skill references external template files (approved.template.yaml, rejected.template.yaml) and a database file (db.yaml) appropriately, but no bundle files were provided to verify these exist. The skill itself is fairly long (~170 lines of substantive content) and some sections like the persona checklists could potentially be split into a referenced file, though the inline approach is defensible for a review checklist that must be consulted every time.	2 / 3
	Total	10 / 12 Passed

Validation

90%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 10 / 11 Passed

Validation for skill structure

Criteria	Description	Result
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	10 / 11 Passed

Repository: DataDog/lading
Commit: e98e305

Reviewed: about 1 hour ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.