CtrlK
BlogDocsLog inGet started
Tessl Logo

tessl-labs/skill-optimizer

Optimize your skills and tiles: review SKILL.md quality, generate eval scenarios, run evals, compare across models, diagnose gaps, and re-run until scores improve.

88

1.07x
Quality

94%

Does it follow best practices?

Impact

88%

1.07x

Average score across 24 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

criteria.jsonevals/scenario-12/

{
  "context": "Tests whether the agent only modifies SKILL.md and not REFERENCE.md when improving a skill bundle, and correctly applies the improvements flagged in the review. The core constraint being tested is that only SKILL.md gets changed — other bundle files are read for context but not modified.",
  "type": "weighted_checklist",
  "checklist": [
    {
      "name": "REFERENCE.md not recreated",
      "description": "The output does NOT include a modified or recreated REFERENCE.md file — REFERENCE.md content is not rewritten as output",
      "max_score": 15
    },
    {
      "name": "No REFERENCE.md changes proposed",
      "description": "change_log.md does NOT propose or describe any changes to REFERENCE.md — it is acknowledged as read-only",
      "max_score": 10
    },
    {
      "name": "SKILL.md produced",
      "description": "An improved SKILL.md file is produced as output",
      "max_score": 8
    },
    {
      "name": "Use when clause added",
      "description": "The output SKILL.md description block includes a 'Use when...' trigger clause",
      "max_score": 12
    },
    {
      "name": "Inline duplication removed",
      "description": "Output SKILL.md removes content that duplicated REFERENCE.md (e.g. resize mode lists, format lists, quality notes that are already in REFERENCE.md)",
      "max_score": 12
    },
    {
      "name": "REFERENCE.md linked",
      "description": "Output SKILL.md links to REFERENCE.md for the detailed content that was removed (not just deletes it)",
      "max_score": 10
    },
    {
      "name": "Core examples retained",
      "description": "Output SKILL.md retains the main executable bash command examples (resize, convert, batch examples)",
      "max_score": 8
    },
    {
      "name": "SKILL.md shorter",
      "description": "Output SKILL.md is shorter than the input SKILL.md (removes duplication)",
      "max_score": 8
    },
    {
      "name": "Change log documents SKILL.md changes",
      "description": "change_log.md describes the changes made to SKILL.md with specific references to what was added/removed",
      "max_score": 8
    },
    {
      "name": "Change log explains why",
      "description": "change_log.md explains why each change improves the skill quality — not just lists what changed",
      "max_score": 9
    }
  ]
}

evals

README.md

tile.json