CtrlK
BlogDocsLog inGet started
Tessl Logo

agent-reviewer

Agent skill for reviewer - invoke with $agent-reviewer

41

1.15x

Quality

13%

Does it follow best practices?

Impact

81%

1.15x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./.agents/skills/agent-reviewer/SKILL.md
SKILL.md
Quality
Evals
Security

Evaluation results

90%

10%

Security Audit: Payment Processing Module

Security review format and categorization

Criteria
Without context
With context

Strengths section

0%

100%

Critical Issues section

100%

100%

Suggestions section

100%

100%

Metrics section

0%

0%

Action Items checklist

37%

75%

Critical severity for security

100%

100%

SQL injection identified

100%

100%

Exposed credentials identified

100%

100%

Missing input validation identified

100%

100%

Concrete fix suggested

100%

100%

Minor/style issues present

100%

100%

Constructive framing

100%

100%

Without context: $0.1838 · 1m 18s · 8 turns · 13 in / 4,107 out tokens

With context: $0.3132 · 1m 41s · 12 turns · 335 in / 5,395 out tokens

72%

9%

Performance Review: E-Commerce Order Service

Performance and code quality review

Criteria
Without context
With context

Review format sections

25%

62%

Action Items checklist

57%

85%

N+1 query identified

100%

100%

N+1 fix suggested

100%

100%

Redundant computation identified

100%

100%

Performance severity classification

0%

0%

SRP violation identified

0%

20%

DRY violation identified

100%

100%

Quality metrics included

0%

0%

Constructive suggestions

100%

100%

Minor issues present

71%

100%

No personal blame language

100%

100%

Without context: $0.3217 · 1m 46s · 12 turns · 18 in / 5,834 out tokens

With context: $0.3416 · 1m 49s · 12 turns · 172 in / 6,023 out tokens

83%

14%

Code Review: Subscription Billing Module

Functionality and maintainability review

Criteria
Without context
With context

Review format structure

12%

75%

Action Items checklist

0%

100%

Edge case gap identified

100%

100%

Error handling gap identified

100%

100%

Functionality severity classification

87%

100%

Unclear naming identified

100%

100%

Testability issue identified

100%

30%

Rename suggestion provided

100%

100%

Metrics section content

0%

0%

Minor severity for style issues

14%

100%

Constructive explanation

100%

100%

Positive feedback present

75%

100%

Without context: $0.2905 · 1m 54s · 8 turns · 13 in / 6,015 out tokens

With context: $0.5454 · 2m 31s · 21 turns · 429 in / 7,706 out tokens

Repository
ruvnet/claude-flow
Evaluated
Agent
Claude Code
Model
Claude Sonnet 4.6

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.