tessl/skill-optimizer

Optimize your skills and tiles: review SKILL.md quality, generate eval scenarios, run evals, compare across models, diagnose gaps, and re-run until scores improve.

1.10x

Quality

91%

Does it follow best practices?

Impact

92%

1.10x

Average score across 25 eval scenarios

Securityby

Passed

No known issues

Expand Eval Coverage for shopify-connector Tile

Name: tessl/skill-optimizer
Rating: 91.95 (1 reviews)
Author: tessl

Problem Description

The shopify-connector tile already has three evaluation scenarios that the team has been using for several months. These scenarios cover order processing, webhook handling, and authentication flows. The team recently ran a new scenario generation job (run ID: scen-gen-7742) that produced two additional scenarios covering edge cases around rate limiting and bulk import operations. They want to add these new scenarios alongside the existing ones.

A previous intern accidentally ran the download command without specifying any strategy flag and overwrote all the existing scenarios. The team wants to make sure this doesn't happen again — they need a download command that explicitly adds new scenarios without touching the existing three.

After downloading, they want to see a verified list of what's in the evals folder to confirm both old and new scenarios are present.

The tile is located at ./shopify-connector/. The existing scenarios are:

evals/order-processing/
evals/webhook-handling/
evals/auth-flow/

Output Specification

Produce a shell script called download-scenarios.sh that:

Downloads the newly generated scenarios from run ID scen-gen-7742 into the tile's evals directory without removing existing scenarios
Verifies the download by listing the scenario files in the evals directory

The script should be safe to re-run if something goes wrong mid-download.

skills

README.md

tile.json

tessl/skill-optimizer

task.md.css-3qkkll{font-size:var(--chakra-font-sizes-sm);font-weight:var(--chakra-font-weights-normal);color:var(--chakra-colors-gray-300);}evals/scenario-19/

Expand Eval Coverage for shopify-connector Tile

Problem Description

Output Specification

task.mdevals/scenario-19/