sre-engineer

Defines service level objectives, creates error budget policies, designs incident response procedures, develops capacity models, and produces monitoring configurations and automation scripts for production systems. Use when defining SLIs/SLOs, managing error budgets, building reliable systems at scale, incident management, chaos engineering, toil reduction, or capacity planning.

Quality

82%

Does it follow best practices?

Run evals on this skill

Adds up to 20 points to the overall score

View guide

Securityby

Passed

No findings from the security scan

Quality

Content

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a solid SRE skill with excellent actionability — the concrete examples (Prometheus alerting, PromQL queries, Python automation) are production-ready and immediately useful. The main weaknesses are that too much detailed content is inline rather than in the referenced files, and the constraints section restates SRE fundamentals Claude already knows. The workflow is clear but could benefit from more explicit feedback loops for error recovery in chaos engineering and incident response scenarios.

Suggestions

Move the lengthy code examples (Prometheus rules, Python script, PromQL queries) into the referenced files (e.g., references/monitoring-alerting.md, references/automation-toil.md) and keep only a brief example in SKILL.md to improve conciseness and progressive disclosure.

Trim the MUST DO / MUST NOT DO constraints to only non-obvious, project-specific rules — remove items like 'Write blameless postmortems' and 'Monitor golden signals' that are standard SRE knowledge Claude already has.

Add explicit feedback loops to the core workflow, e.g., 'If SLO targets don't align with user expectations → adjust targets and re-validate' and 'If chaos experiment reveals unmet RTO/RPO → fix recovery mechanism → re-run experiment'.

Dimension	Reasoning	Score
Conciseness	The skill is reasonably efficient but includes some redundancy. The constraints section (MUST DO / MUST NOT DO) largely restates SRE principles Claude already knows. The concrete examples are valuable but lengthy — the Python script and Prometheus rules could be trimmed or moved to reference files. Some items like 'Write blameless postmortems for all incidents' are standard SRE knowledge that don't need explicit instruction.	2 / 3
Actionability	The skill provides fully executable, copy-paste ready examples: a complete Prometheus alerting rule with multiwindow burn rate, PromQL golden signal queries, a working Python auto-remediation script, and concrete error budget calculations. These are specific, real-world configurations rather than pseudocode or abstract descriptions.	3 / 3
Workflow Clarity	The core workflow has a clear 6-step sequence and includes a validation checkpoint in step 3 ('Verify alignment') and step 6 ('verify recovery meets RTO/RPO targets'). However, the workflow lacks explicit feedback loops for error recovery — e.g., what happens if SLO targets don't align with user expectations in step 3, or if chaos experiments fail. For a skill involving destructive operations (chaos engineering) and production systems, the validation steps are present but could be more explicit with retry/fix loops.	2 / 3
Progressive Disclosure	The reference table is well-structured with clear 'Load When' guidance, pointing to five separate reference files. However, no bundle files were provided, so we can't verify these references exist. More importantly, the SKILL.md itself contains substantial inline content (lengthy Prometheus rules, Python scripts, PromQL queries) that would be better placed in the referenced files, making the main file a true overview rather than a hybrid overview+detailed-examples document.	2 / 3
	Total	9 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly defines the SRE domain with specific concrete actions, comprehensive trigger terms covering both acronyms and natural language, and an explicit 'Use when' clause. It is well-structured, uses third person voice throughout, and occupies a distinct niche that minimizes conflict risk with other skills.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: 'defines service level objectives', 'creates error budget policies', 'designs incident response procedures', 'develops capacity models', 'produces monitoring configurations and automation scripts'. These are all concrete, well-defined deliverables.	3 / 3
Completeness	Clearly answers both 'what' (defines SLOs, creates error budget policies, designs incident response procedures, develops capacity models, produces monitoring configs) and 'when' with an explicit 'Use when...' clause listing specific trigger scenarios.	3 / 3
Trigger Term Quality	Includes strong natural keywords users would say: 'SLIs/SLOs', 'error budgets', 'incident management', 'chaos engineering', 'toil reduction', 'capacity planning', 'reliable systems at scale', 'monitoring configurations'. These cover the SRE domain comprehensively with both acronyms and full terms.	3 / 3
Distinctiveness Conflict Risk	Occupies a clear SRE/reliability engineering niche with highly specific trigger terms like 'SLIs/SLOs', 'error budgets', 'chaos engineering', and 'toil reduction' that are unlikely to conflict with general DevOps, monitoring, or incident management skills.	3 / 3
	Total	12 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: Jeffallan/claude-skills
Path: skills/sre-engineer/SKILL.md
Commit: e8be415

Reviewed: about 4 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.