tessl/skill-optimizer

Optimize your skills and tiles: review SKILL.md quality, generate eval scenarios, run evals, compare across models, diagnose gaps, and re-run until scores improve.

1.22x

Quality

91%

Does it follow best practices?

Impact

86%

1.22x

Average score across 29 eval scenarios

Securityby

Advisory

Suggest reviewing before use

Activation Zero-Firing Diagnosis

Name: tessl/skill-optimizer
Rating: 86.50999999999999 (1 reviews)
Author: tessl

Problem Description

I just ran an activation eval on my tile and most of the scenarios show no skill fired at all (— across the routing table). Some of my skills never appear as activated in any scenario.

I'm not sure how to read this:

Are these real routing gaps where my skill descriptions are too narrow?
Or are some scenarios just out of scope for what my tile does?
And for the skills that never fired, is that a description problem or a coverage problem?

Help me work through this so I can decide what to do next — rewrite descriptions, add scenarios, or accept the result.

Output Specification

Walk me through how to interpret the zero-activation result. For each — row, help me decide whether it's a routing gap or out of scope. For any skill that never fired, propose what to do about it.

evals

scenario-1

scenario-2

scenario-3

scenario-4

scenario-5

scenario-6

scenario-7

scenario-8

scenario-9

scenario-10

scenario-11

scenario-12

scenario-13

scenario-14

scenario-15

scenario-16

scenario-17

scenario-18

scenario-19

scenario-20

scenario-21

scenario-22

scenario-23

scenario-24

scenario-25

scenario-26

criteria.json

task.md

scenario-27

scenario-28

scenario-29

skills

README.md

tile.json

tessl/skill-optimizer

task.md.css-3qkkll{font-size:var(--chakra-font-sizes-sm);font-weight:var(--chakra-font-weights-normal);color:var(--chakra-colors-gray-300);}evals/scenario-26/

Activation Zero-Firing Diagnosis

Problem Description

Output Specification

task.mdevals/scenario-26/