Skill de observabilidade e confiabilidade operacional. Use quando precisar definir logs, metricas, tracing, alertas, health checks, readiness, error budgets, rollback e operacao segura de servicos. Trigger em: "observabilidade", "observability", "SRE", "logs estruturados", "metricas", "tracing distribuido", "health check", "readiness probe", "error budget", "SLO", "alertas", "rollback seguro", "runbook operacional".
56
62%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/20-observability-sre/SKILL.mdQuality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a well-structured skill description with strong trigger term coverage in both Portuguese and English, clear 'when to use' guidance, and a distinct niche in observability/SRE. Its main weakness is that the capability descriptions lean toward listing topics rather than specifying concrete actions (e.g., 'define logs' rather than 'configure structured logging pipelines, set up distributed tracing, create alerting rules').
Suggestions
Replace the generic verb 'definir' with more specific action verbs for each capability area, e.g., 'configure structured logging, instrument distributed tracing, define SLOs and error budgets, create alerting rules and runbooks'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain (observability and operational reliability) and lists several relevant concepts (logs, metrics, tracing, alerts, health checks, readiness, error budgets, rollback), but these read more as a list of topics than concrete actions. The verb 'definir' (define) is used but is somewhat vague about what specific actions are performed with each concept. | 2 / 3 |
Completeness | The description clearly answers both 'what' (observability and operational reliability covering logs, metrics, tracing, alerts, health checks, etc.) and 'when' (explicit 'Use quando precisar...' clause plus a dedicated 'Trigger em:' section listing specific trigger terms). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms in both Portuguese and English, including 'observabilidade', 'observability', 'SRE', 'logs estruturados', 'metricas', 'tracing distribuido', 'health check', 'readiness probe', 'error budget', 'SLO', 'alertas', 'rollback seguro', 'runbook operacional'. These are terms users would naturally use when seeking this type of skill. | 3 / 3 |
Distinctiveness Conflict Risk | The skill occupies a clear niche in observability and SRE practices with highly specific trigger terms like 'error budget', 'SLO', 'readiness probe', 'tracing distribuido', and 'runbook operacional' that are unlikely to conflict with other skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
35%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
The skill covers a broad scope of observability and SRE concerns but suffers from verbosity and mixed content types — combining versioning history, inspiration quotes, roadmap items, and cost estimates alongside operational guidance. The workflows provide reasonable structure but lack fully executable examples and explicit rollback/recovery procedures. The content would benefit significantly from trimming non-actionable sections and externalizing detailed reference material.
Suggestions
Remove the version history narrative ('Before v2.7.0...'), inspiration quotes, roadmap section, and cost estimates — these don't help Claude execute tasks and waste tokens.
Add executable code examples for at least one observability tool (e.g., a complete structured logging setup, a health check endpoint implementation, or a concrete Datadog API call with full syntax).
Move the Runtime Feedback Sensors section (SLO-driven workflows, log anomaly detection, response quality sampling) to a separate referenced file to keep SKILL.md as a concise overview.
Add explicit rollback criteria and procedure in the SLO-driven workflow — specify the exact threshold that triggers rollback and the commands to execute it, with a validate-fix-retry loop.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is very verbose with significant padding. It explains concepts Claude already knows (what SRE is, what logs/metrics/tracing are), includes a lengthy 'Runtime Feedback Sensors' section with inspiration quotes, version history, roadmap items, and cost estimates that don't add actionable value. The 'gap closed' section, roadmap, and references to internal versioning (v2.7.0+) are noise for an instruction skill. | 1 / 3 |
Actionability | The workflows provide step-by-step sequences with some concrete guidance (e.g., specific CLI commands like 'aws cloudwatch get-metric-statistics', sampling percentages), but lack executable code examples. Commands are listed as names without full syntax, and the workflows are more pseudocode/process descriptions than copy-paste ready instructions. | 2 / 3 |
Workflow Clarity | Multiple workflows are presented with numbered steps and some validation checkpoints (e.g., re-pull SLO after deploy, check against budget). However, the rollback decision is mentioned but not formalized with explicit criteria or feedback loops. The 'SLO-driven feature work' workflow has a validation gap — step 4 mentions rollback 'considered' but doesn't specify the actual rollback procedure or threshold enforcement. | 2 / 3 |
Progressive Disclosure | References to external files exist (policies/handoffs.md, templates/observability-check.md, docs/skill-guides/observability-sre.md) and are one-level deep, which is good. However, the main file is monolithic with the massive Runtime Feedback Sensors section inlined when it could be a separate reference. The integration table and roadmap add bulk that could be externalized. | 2 / 3 |
Total | 7 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
7577622
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.