Skill de observabilidade e confiabilidade operacional. Use quando precisar definir logs, metricas, tracing, alertas, health checks, readiness, error budgets, rollback e operacao segura de servicos.
61
52%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/20-observability-sre/SKILL.mdQuality
Discovery
82%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a reasonably well-constructed skill description that covers the observability and reliability domain with good trigger terms and an explicit 'Use when' clause. Its main weaknesses are that the capabilities are listed more as topics than concrete actions, and some terms could overlap with adjacent DevOps/infrastructure skills. The description would benefit from more specific action verbs and clearer differentiation from related operational skills.
Suggestions
Replace the generic 'definir' with multiple specific action verbs, e.g., 'Configura pipelines de logging, implementa distributed tracing, define alertas baseados em SLOs, configura health checks e readiness probes'.
Add distinguishing context to reduce overlap with DevOps/deployment skills, e.g., 'Focuses on production observability patterns and SRE practices, not CI/CD pipelines or infrastructure provisioning'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain (observability and operational reliability) and lists several relevant concepts (logs, metrics, tracing, alerts, health checks, readiness, error budgets, rollback), but these are more like topic keywords than concrete actions. It says 'definir' (define) but doesn't describe multiple distinct actions like 'configure alerts, set up dashboards, implement tracing'. | 2 / 3 |
Completeness | The description clearly answers both 'what' (observability and operational reliability skill covering logs, metrics, tracing, alerts, health checks, etc.) and 'when' with an explicit 'Use quando precisar...' clause listing specific trigger scenarios. | 3 / 3 |
Trigger Term Quality | Good coverage of natural terms a user would mention: logs, metricas, tracing, alertas, health checks, readiness, error budgets, rollback, operacao segura. These are terms practitioners naturally use when discussing observability and reliability topics. | 3 / 3 |
Distinctiveness Conflict Risk | While the observability/reliability niche is reasonably specific, terms like 'logs', 'alertas', and 'rollback' could overlap with DevOps, deployment, or monitoring skills. The combination of all these terms together helps, but individually several could trigger conflicts with related infrastructure skills. | 2 / 3 |
Total | 10 / 12 Passed |
Implementation
22%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill reads as a high-level role description rather than actionable operational guidance. It lacks any concrete examples—no code snippets for structured logging, metric definitions, health check implementations, alerting rules, or tracing configuration. The checklist and responsibilities are abstract and largely redundant with each other, and there is no sequenced workflow with validation steps for what should be a multi-step operational process.
Suggestions
Add concrete, executable examples for key tasks: a structured log format snippet, a health check endpoint implementation, a sample alerting rule, and a basic tracing setup in the project's actual stack.
Define a clear sequenced workflow for implementing observability (e.g., 1. Audit existing signals → 2. Define structured log schema → 3. Add health/readiness endpoints → 4. Validate with test requests → 5. Configure alerts → 6. Verify alert fires correctly), with explicit validation checkpoints.
Remove or consolidate redundant sections—'Responsabilidades' and 'Checklist Base' overlap significantly. Merge into a single actionable checklist with concrete acceptance criteria.
For each referenced file (policies, templates, skill-guides), add a one-line description of what it contains so Claude can decide whether to consult it without needing to open it.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is reasonably structured and not excessively verbose, but it includes several sections that are somewhat generic and don't add much value (e.g., 'Quando Usar' and 'Quando Nao Usar' are fairly obvious). The 'Responsabilidades' section largely restates the checklist. Some tightening would help. | 2 / 3 |
Actionability | The content is entirely abstract and descriptive—no concrete code examples, commands, configuration snippets, or specific tool usage. Statements like 'definir sinais essenciais: logs, metricas, traces e alertas' describe what to do but never show how. There are no executable examples for structured logging, metric definitions, health check endpoints, alerting rules, or tracing setup. | 1 / 3 |
Workflow Clarity | There is no clear sequenced workflow for implementing observability. The checklist is a flat list of items without ordering, dependencies, or validation checkpoints. For a skill involving operational safety (rollback, health checks, alerting), the absence of any step-by-step process with verification points is a significant gap. | 1 / 3 |
Progressive Disclosure | The skill references external files (GLOBAL.md, various policies, docs/skill-guides/observability-sre.md, templates) which shows some progressive disclosure structure. However, the references are numerous and not clearly signaled with descriptions of what each contains, making navigation harder. The main content itself is thin, suggesting the real value may be buried in referenced files. | 2 / 3 |
Total | 6 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
d87ad31
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.