CtrlK
BlogDocsLog inGet started
Tessl Logo

optimize-skill

Use when iteratively optimizing an existing SKILL.md (or a skill folder with bundle files) — runs a Tessl-gated Ralph loop with snapshot-and-revert protection, never accepts a worse `tessl skill review` score, and stops when no candidate change improves both the score and the structural quality. Triggers for `/optimize-skill PATH`, "make this skill better", "iterate on this SKILL.md", "improve this skill's tessl score".

72

Quality

88%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong description that clearly communicates a specific, well-defined capability with explicit trigger terms and conditions. It effectively answers both what the skill does and when to use it, with domain-specific terminology that makes it highly distinguishable. The only minor weakness is the use of jargon ('Ralph loop', 'Tessl-gated') which may not be universally understood, but these terms serve as precise identifiers for the skill's niche.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: 'runs a Tessl-gated Ralph loop with snapshot-and-revert protection', 'never accepts a worse tessl skill review score', 'stops when no candidate change improves both the score and the structural quality'. These are detailed, concrete behaviors.

3 / 3

Completeness

Clearly answers both 'what' (iteratively optimizes SKILL.md via Tessl-gated Ralph loop with snapshot-and-revert, rejecting worse scores) and 'when' (explicit 'Triggers for' clause with specific commands and natural language phrases). The 'Use when' equivalent is present at the start and the 'Triggers for' clause provides explicit trigger guidance.

3 / 3

Trigger Term Quality

Includes natural trigger terms users would say: '/optimize-skill PATH', 'make this skill better', 'iterate on this SKILL.md', 'improve this skill's tessl score', plus 'SKILL.md' and 'skill folder'. Good coverage of both command-style and natural language triggers.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive — targets a very specific niche (iterative optimization of SKILL.md files using Tessl scoring with a Ralph loop). The combination of 'tessl skill review', 'SKILL.md', and the optimization loop concept makes it very unlikely to conflict with other skills.

3 / 3

Total

12

/

12

Passed

Implementation

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-crafted, highly actionable skill with excellent workflow clarity — the iterative optimize-score-revert loop is precisely defined with executable code, hard gates, and explicit error recovery. Its main weakness is moderate verbosity: the process is described three times (mermaid flowchart, step details, data flow section), and the anti-patterns section partially overlaps with the acceptance criteria. The reference to REFERENCE.md is appropriate progressive disclosure but cannot be verified without the bundle.

Suggestions

Remove or significantly condense the 'Data flow' section — it restates the mermaid flowchart and step details without adding new information.

Consolidate the 'Acceptance Criteria' table and 'What we do NOT want' section to eliminate overlap (e.g., G1/G2 are restated in the anti-patterns list).

DimensionReasoningScore

Conciseness

The skill is reasonably efficient for its complexity but includes some verbose sections — the 'What we do NOT want' section repeats concepts already covered in the acceptance criteria table, and the 'Data flow' section largely restates the process flowchart and step details. The mermaid diagram + step details + data flow is triple-coverage of the same workflow.

2 / 3

Actionability

Provides fully executable bash snippets for baseline scoring, snapshot/revert with rsync, numeric comparison with awk, and jq extraction. The invocation syntax is concrete with defaults specified, and the ROI-ranked candidate table gives specific, actionable optimization strategies with effort/risk ratings.

3 / 3

Workflow Clarity

The multi-step process is clearly sequenced with explicit validation at every iteration (G1 gate), a revert mechanism on failure, a final subagent verification pass (G6), and a whole-run revert if G2 fails. The feedback loop (apply → score → compare → keep/revert) is explicit with code. Acceptance criteria are enumerated as hard gates.

3 / 3

Progressive Disclosure

The skill references a sibling REFERENCE.md for empirical evidence and common mistakes, which is good progressive disclosure. However, no bundle files were provided, so we can't verify REFERENCE.md exists. The main SKILL.md itself is quite long (~200+ lines) and the Data flow section could arguably be extracted to a reference file rather than duplicating the flowchart content inline.

2 / 3

Total

10

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
AndreJorgeLopes/proof-of-skill
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.