CtrlK
BlogDocsLog inGet started
Tessl Logo

uinaf/autoreview

Run structured Codex/Claude autoreview closeout: choose the target, collect schema-validated findings, and rerun tests plus review until clean.

84

1.08x
Quality

89%

Does it follow best practices?

Impact

74%

1.08x

Average score across 4 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is highly actionable with a clear validated workflow, but it repeats its stop/dirty-mode rules and points to reference files that do not exist in the bundle. Tightening the repetition and shipping the referenced files would raise it to top marks.

Suggestions

Add the missing references/ files (scope.md, troubleshooting.md, engine-details.md, upstream.md) so the inline links resolve, or remove the dangling references.

Consolidate the repeated "exit 0 with no accepted/actionable findings → stop" rule and dirty-mode caveat into one authoritative section instead of restating them in Core Workflow, Context Efficiency, Helper, and Final Report.

Trim the engine Defaults prose (e.g., "usually delivers the best review results") that states preference Claude can infer from the contract.

DimensionReasoningScore

Conciseness

The body is command-dense and assumes competence, but the clean-exit/stop condition and dirty-mode caveats recur across Core Workflow, Context Efficiency, Helper, and Final Report, so it could be tightened.

2 / 3

Actionability

It provides copy-paste-ready executable bash for every target mode, panels, and the harness, with real flags and model/thinking syntax rather than pseudocode.

3 / 3

Workflow Clarity

The numbered 7-step Core Workflow sequences pick-target → verify → fix → rerun-tests-plus-autoreview → stop-on-clean with an explicit validation feedback loop.

3 / 3

Progressive Disclosure

References are well-signaled and one level deep in the text, but the references/ directory and all four referenced .md files are absent on disk, so the navigation is broken rather than functional.

2 / 3

Total

10

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is specific, trigger-rich, and complete with an explicit Use-when clause in third person. It cleanly satisfies all four dimensions with no vague fluff.

DimensionReasoningScore

Specificity

"choose the target, validate findings, rerun focused tests, and repeat review until clean" lists multiple concrete actions across local changes, PRs, branch diffs, and commits, matching the score-3 anchor.

3 / 3

Completeness

It states both what the skill does (run structured autoreview closeout) and when to use it via an explicit "Use when asked for..." clause, satisfying the what+when requirement.

3 / 3

Trigger Term Quality

"autoreview, second-model review, pre-merge review, or readiness-to-ship review" alongside "Codex review / Claude review" gives good coverage of natural phrasings a user would actually say.

3 / 3

Distinctiveness Conflict Risk

It carves a clear closeout-review niche with distinct triggers and uses third-person voice, making conflict with unrelated skills unlikely.

3 / 3

Total

12

/

12

Passed

Validation

87%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation14 / 16 Passed

Validation for skill structure

CriteriaDescriptionResult

relative_links

Relative link issues: 9 missing

Warning

referenced_paths_exist

Referenced path issues: 18 missing

Warning

Total

14

/

16

Passed

Reviewed

Table of Contents