Defines service level objectives, creates error budget policies, designs incident response procedures, develops capacity models, and produces monitoring configurations and automation scripts for production systems. Use when defining SLIs/SLOs, managing error budgets, building reliable systems at scale, incident management, chaos engineering, toil reduction, or capacity planning.
98
100%
Does it follow best practices?
Impact
96%
1.07xAverage score across 6 eval scenarios
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that follows best practices. It uses third person voice, lists specific concrete capabilities, includes a comprehensive 'Use when...' clause with natural trigger terms from the SRE domain, and carves out a clear niche that distinguishes it from general DevOps or monitoring skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'Defines service level objectives', 'creates error budget policies', 'designs incident response procedures', 'develops capacity models', and 'produces monitoring configurations and automation scripts'. | 3 / 3 |
Completeness | Clearly answers both what (defines SLOs, creates error budget policies, designs incident response, develops capacity models, produces monitoring configs) AND when with explicit 'Use when...' clause listing seven specific trigger scenarios. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural SRE terminology users would say: 'SLIs/SLOs', 'error budgets', 'reliable systems at scale', 'incident management', 'chaos engineering', 'toil reduction', 'capacity planning'. These are standard industry terms practitioners would naturally use. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive SRE/reliability engineering niche with domain-specific triggers like 'SLIs/SLOs', 'error budgets', 'chaos engineering', and 'toil reduction' that are unlikely to conflict with general DevOps or monitoring skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
100%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is an exemplary skill file that demonstrates best practices across all dimensions. It's concise yet comprehensive, provides executable code examples for all major SRE tasks, has clear workflow steps with validation checkpoints, and uses progressive disclosure effectively through a well-organized reference table. The MUST DO/MUST NOT DO constraints provide clear boundaries without being verbose.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is lean and efficient, assuming Claude's competence with SRE concepts. No unnecessary explanations of what SLOs, SLIs, or error budgets are—it jumps straight to actionable guidance and concrete examples. | 3 / 3 |
Actionability | Provides fully executable code examples including Prometheus alerting rules, PromQL queries, and a complete Python automation script. The SLO calculation example shows exact math, and all code is copy-paste ready. | 3 / 3 |
Workflow Clarity | The core workflow has clear numbered steps with explicit validation checkpoints ('Verify alignment', 'verify recovery meets RTO/RPO targets before marking complete'). The MUST DO/MUST NOT DO constraints provide clear guardrails for complex operations. | 3 / 3 |
Progressive Disclosure | Excellent structure with a reference table pointing to one-level-deep topic files with clear 'Load When' guidance. The main skill provides a concise overview while detailed guidance is appropriately delegated to reference files. | 3 / 3 |
Total | 12 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
5b76101
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.