CtrlK
BlogDocsLog inGet started
Tessl Logo

uinaf/cross-harness-review

Run a read-only code review through the opposite AI coding harness and return uinaf review-style findings, evidence, unverified gaps, and a ship-it / needs-review / blocked verdict. Use for /review-with-claudex, cross-harness review, opposite-model review, Claude reviewing Codex work, or Codex reviewing Claude work.

92

Quality

92%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong description that clearly defines a specific niche (cross-AI-harness code review), lists concrete outputs (findings, evidence, gaps, verdict), and provides explicit trigger terms covering multiple natural phrasings. The negative boundary statement adds further clarity. The only notable flaw is what appears to be a typo ('uinaf' instead of likely 'unified'), which slightly reduces readability.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: 'run a read-only code review', 'return findings, evidence, unverified gaps', and 'a ship-it / needs-review / blocked verdict'. Also explicitly states what it is NOT ('Not an implementation or fix loop').

3 / 3

Completeness

Clearly answers both 'what' (run read-only code review, return review-style findings/evidence/gaps/verdict) and 'when' (explicit 'Use for' clause with multiple trigger scenarios). Also includes a negative boundary ('Not an implementation or fix loop').

3 / 3

Trigger Term Quality

Includes strong natural trigger terms: '/review-with-claudex', 'cross-harness review', 'opposite-model review', 'Claude reviewing Codex work', 'Codex reviewing Claude work'. These cover multiple natural phrasings a user might use. The term 'uinaf' appears to be a typo (likely 'unified') which slightly reduces clarity but doesn't undermine the trigger coverage.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive niche: cross-harness AI-to-AI code review with specific triggers like '/review-with-claudex' and 'opposite-model review'. The explicit exclusion of implementation/fix loops further distinguishes it from general code review or coding skills.

3 / 3

Total

12

/

12

Passed

Implementation

79%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-crafted, actionable skill that efficiently teaches cross-harness code review with concrete CLI commands and a clear output contract. Its main weaknesses are the lack of explicit validation/error-recovery steps after delegation (e.g., what if the opposite harness returns malformed output?) and the inability to verify the referenced bundle file. The shared prompt template and dual output examples (success and blocked) are strong design choices.

Suggestions

Add a validation step after receiving the opposite harness's output (e.g., check that the response contains the required verdict field before reporting), which would improve workflow clarity for this multi-step delegation process.

Ensure the referenced file `references/headless-modes.md` exists in the bundle, or note its expected contents inline as a fallback.

DimensionReasoningScore

Conciseness

The skill is lean and efficient. It assumes Claude understands what a harness is, what diffs/PRs are, and how CLI tools work. No unnecessary explanations of basic concepts. Every section serves a clear purpose.

3 / 3

Actionability

Provides fully executable CLI commands for both directions (Codex→Claude, Claude→Codex), a concrete shared prompt template, specific tool allowlists, and exact output format examples including the failure case. Copy-paste ready.

3 / 3

Workflow Clarity

The workflow steps are clearly sequenced (define target → detect harness → preflight → delegate → report), but there's no explicit validation checkpoint after the review delegation step. The 'blocked' output example partially addresses failure, but there's no feedback loop for partial failures or malformed output from the opposite harness.

2 / 3

Progressive Disclosure

References headless-modes.md for CLI flag details which is good progressive disclosure, but no bundle files were provided so we can't verify the reference exists. The skill itself is well-structured with clear sections, but the inline content for both harness directions could potentially be split into separate reference files for cleaner navigation.

2 / 3

Total

10

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Reviewed

Table of Contents