Optimize your skills and tiles: review SKILL.md quality, generate eval scenarios, run evals, compare across models, diagnose gaps, and re-run until scores improve.
88
94%
Does it follow best practices?
Impact
88%
1.07xAverage score across 24 eval scenarios
Passed
No known issues
An internal tooling team has just reviewed their deploy-helper skill and received raw output from their skill evaluation tool. The feedback contains a mix of errors, missing required elements, and style issues — but the raw output is hard to act on. The team wants a structured, prioritized recommendation document they can work through systematically.
Your task is to process the skill review output below and produce a clear, actionable recommendations document. The document should tell the team exactly what to fix, in what order, how to make each change, and why each change matters for their skill's quality.
Produce a file recommendations.md containing actionable, structured recommendations the team can work through to improve the skill. Organize them so the most important fixes are tackled first. For each recommendation include enough context that a developer can understand the issue and implement the change without needing additional research.
The following files are provided as inputs. Extract them before beginning.
=============== FILE: review_output.txt =============== === Skill Review: deploy-helper === Overall Score: 51%
ERRORS (must fix): [ERROR] Description field is missing the "Use when..." trigger clause [ERROR] Broken file reference: deployment checklist - file not found
Dimension Scores: Completeness: 1/3 (33%) - Missing "Use when" clause reduces routing accuracy Actionability: 1/3 (33%) - Instructions are abstract; no executable examples provided Conciseness: 2/3 (66%) - Some repeated explanations; several paragraphs explain concepts Claude already knows (e.g., what JSON is, how HTTP works) Robustness: 2/3 (66%) - Good error handling patterns, could add retry logic examples
Judge Suggestions:
This skill helps you deploy services to the internal platform.
REST APIs use HTTP methods like GET, POST, PUT, and DELETE to communicate. JSON is a text format for data exchange. Our deploy system uses these standard patterns.
When you want to deploy a service, you should:
For more information see the deployment checklist.
The deployment configuration should include your service name, image reference, replica count, and resource limits. Make sure these are set correctly before deploying. =============== END FILE ===============
evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
scenario-11
scenario-12
scenario-13
scenario-14
scenario-15
scenario-16
scenario-17
scenario-18
scenario-19
scenario-20
scenario-21
scenario-22
scenario-23
scenario-24
skills
compare-skill-model-performance
optimize-skill-instructions
references
optimize-skill-performance
optimize-skill-performance-and-instructions