CtrlK
BlogDocsLog inGet started
Tessl Logo

code-review-and-quality

Conducts multi-axis code review. Use before merging any change. Use when reviewing code written by yourself, another agent, or a human. Use when you need to assess code quality across multiple dimensions before it enters the main branch.

57

Quality

64%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/code-review-and-quality/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

62%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured code review skill with a clear five-step process, good severity labeling system, and comprehensive checklists. Its main weaknesses are verbosity (explaining concepts Claude already knows, like what edge cases or N+1 queries are) and lack of concrete worked examples showing an actual review being performed on real code. The content would benefit from trimming explanatory material and adding one or two concrete review examples.

Suggestions

Add a concrete worked example showing a real code snippet being reviewed with actual review comments using the severity labels (Critical, Nit, Optional, FYI) — this would significantly boost actionability.

Trim sections that explain concepts Claude already knows (e.g., what edge cases are, what N+1 queries are, what dead code is) to just the actionable checklist items, reducing token cost by ~30-40%.

Move the detailed five-axis criteria and the full review checklist into separate referenced files (e.g., `references/review-axes.md`, `references/review-checklist.md`) to improve progressive disclosure and keep the main skill focused on the workflow.

Either provide the referenced bundle files (security-checklist.md, performance-checklist.md) or remove the references to avoid broken navigation.

DimensionReasoningScore

Conciseness

The skill is comprehensive but verbose for an audience of Claude. Many sections explain concepts Claude already knows well (what edge cases are, what N+1 queries are, what dead code is). The 'Common Rationalizations' and 'Red Flags' tables, while useful, add significant token cost for information that's largely common sense for an LLM. The content could be tightened by ~40% without losing actionable value.

2 / 3

Actionability

The skill provides structured checklists, severity labels, and a clear review process, which is good. However, it lacks concrete executable examples — there are no actual code snippets showing a review being performed, no example of a review comment with proper labeling applied to real code, and the guidance remains largely descriptive rather than demonstrative. The dead code example is a good exception but is one of few concrete illustrations.

2 / 3

Workflow Clarity

The five-step review process is clearly sequenced (understand context → review tests → review implementation → categorize findings → verify verification). Severity labeling provides a clear decision framework. The final verification checklist serves as an explicit quality gate before merge. The multi-model review pattern adds a useful feedback loop.

3 / 3

Progressive Disclosure

The skill references external files ('security-and-hardening', 'performance-optimization', 'references/security-checklist.md', 'references/performance-checklist.md') but no bundle files are provided, so these references are unverifiable. The content itself is monolithic — at ~300 lines, sections like the detailed five-axis criteria, change sizing strategies, and the full review checklist could be split into referenced files to keep the main skill leaner.

2 / 3

Total

9

/

12

Passed

Description

67%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description has strong completeness with explicit 'Use when' clauses covering multiple scenarios, which is its greatest strength. However, it lacks specificity about what the 'multiple axes' or 'dimensions' of review actually are, and it misses common trigger terms like 'pull request' or 'PR.' Adding concrete actions and broader trigger vocabulary would significantly improve skill selection accuracy.

Suggestions

List the specific review axes/dimensions (e.g., 'Evaluates security vulnerabilities, performance bottlenecks, style consistency, error handling, and test coverage').

Add common trigger term variations such as 'pull request,' 'PR review,' 'diff,' 'code feedback,' or 'MR' to improve matching against natural user language.

DimensionReasoningScore

Specificity

It names the domain ('code review') and mentions 'multi-axis' and 'multiple dimensions,' but does not list the specific actions or axes involved (e.g., security checks, performance analysis, style linting). The concrete actions remain vague beyond 'conducts code review' and 'assess code quality.'

2 / 3

Completeness

Clearly answers both 'what' (conducts multi-axis code review, assesses code quality across multiple dimensions) and 'when' (before merging any change, when reviewing code written by yourself/another agent/a human, when assessing code quality before it enters the main branch). Explicit 'Use when' clauses are present.

3 / 3

Trigger Term Quality

Includes relevant terms like 'code review,' 'merging,' 'code quality,' and 'main branch,' which users might naturally say. However, it misses common variations such as 'pull request,' 'PR review,' 'diff review,' 'code feedback,' or 'lint.'

2 / 3

Distinctiveness Conflict Risk

The focus on 'multi-axis code review' before merging provides some distinctiveness, but 'code review' is a broad category that could overlap with linting skills, security audit skills, or general code feedback skills. The lack of specific axes or unique methodology limits its distinctiveness.

2 / 3

Total

9

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
addyosmani/agent-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.