Defines service level objectives, creates error budget policies, designs incident response procedures, develops capacity models, and produces monitoring configurations and automation scripts for production systems. Use when defining SLIs/SLOs, managing error budgets, building reliable systems at scale, incident management, chaos engineering, toil reduction, or capacity planning.
95
92%
Does it follow best practices?
Impact
96%
1.07xAverage score across 6 eval scenarios
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong skill description that clearly defines the SRE domain with specific capabilities and comprehensive trigger terms. It follows the recommended pattern with concrete actions followed by an explicit 'Use when...' clause. The domain-specific terminology (SLIs/SLOs, error budgets, chaos engineering, toil reduction) makes it highly distinctive and easy for Claude to match against relevant user requests.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'defines service level objectives', 'creates error budget policies', 'designs incident response procedures', 'develops capacity models', 'produces monitoring configurations and automation scripts'. These are all concrete, actionable capabilities. | 3 / 3 |
Completeness | Clearly answers both 'what' (defines SLOs, creates error budget policies, designs incident response procedures, develops capacity models, produces monitoring configs) and 'when' with an explicit 'Use when...' clause listing specific trigger scenarios. | 3 / 3 |
Trigger Term Quality | Includes strong natural keywords users would say: 'SLIs/SLOs', 'error budgets', 'incident management', 'chaos engineering', 'toil reduction', 'capacity planning', 'reliable systems at scale', 'monitoring configurations'. These cover both acronyms and full terms that SRE practitioners naturally use. | 3 / 3 |
Distinctiveness Conflict Risk | Occupies a clear SRE/reliability engineering niche with highly specific domain terms like 'SLIs/SLOs', 'error budgets', 'chaos engineering', and 'toil reduction' that are unlikely to conflict with general DevOps, monitoring, or coding skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
85%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a strong SRE skill with excellent actionability through production-ready code examples and good progressive disclosure via a well-structured reference table. The workflow is clear with appropriate validation checkpoints. Minor improvements could be made in conciseness by trimming some of the constraints that represent standard SRE knowledge Claude already possesses.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is reasonably efficient but includes some content that could be tightened. The constraints section has items that are standard SRE knowledge Claude would already know (e.g., 'write blameless postmortems', 'balance reliability with feature velocity'). The concrete examples are valuable but the Python script is quite long for a skill file that also references external files. | 2 / 3 |
Actionability | Excellent actionability with fully executable Prometheus alerting rules, PromQL queries, and a complete Python auto-remediation script. The SLO definition includes concrete calculations with specific numbers. All code examples are copy-paste ready and production-relevant. | 3 / 3 |
Workflow Clarity | The core workflow is clearly sequenced with six numbered steps. It includes explicit validation checkpoints ('Verify alignment — Confirm SLO targets reflect user expectations before proceeding') and feedback loops ('verify recovery meets RTO/RPO targets before marking the experiment complete'). The error budget example demonstrates a clear decision trigger for policy action. | 3 / 3 |
Progressive Disclosure | Excellent progressive disclosure with a clear reference table that specifies exactly when to load each detailed reference file. The main skill provides an overview with concrete examples while deferring detailed guidance to five well-organized reference files, all one level deep with clear navigation signals. | 3 / 3 |
Total | 11 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
3d95bb1
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.