observability-sre

Skill de observabilidade e confiabilidade operacional. Use quando precisar definir logs, metricas, tracing, alertas, health checks, readiness, error budgets, rollback e operacao segura de servicos. Trigger em: "observabilidade", "observability", "SRE", "logs estruturados", "metricas", "tracing distribuido", "health check", "readiness probe", "error budget", "SLO", "alertas", "rollback seguro", "runbook operacional".

Quality

62%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Fix and improve this skill with Tessl

tessl review fix ./skills/20-observability-sre/SKILL.md

Quality

Content

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill covers observability and SRE comprehensively but suffers from verbosity and conceptual padding that dilutes its actionability. The Runtime Feedback Sensors section, while interesting, dominates the file with meta-commentary, inspiration quotes, and roadmap items that don't help Claude execute tasks. The core operational content (checklists, responsibilities, workflows) is solid but would benefit from executable code examples and tighter organization.

Suggestions

Remove or drastically shorten the 'Runtime Feedback Sensors' inspiration/gap/roadmap sections — move roadmap and inspiration to a separate reference file and keep only the actionable workflows.

Add executable code examples for key tasks: a structured logging setup snippet, a health check endpoint implementation, a metrics instrumentation example, and an actual rollback command sequence.

Make rollback criteria explicit in the SLO-driven workflow — specify exact thresholds and commands rather than 'rollback considerado'.

Move the integration table, roadmap, and references sections to a linked file (e.g., docs/skill-guides/observability-sre.md) to reduce the main skill's token footprint.

Dimension	Reasoning	Score
Conciseness	The skill is very verbose with significant padding. It explains concepts Claude already knows (what SLOs are, what observability is), includes a roadmap section with version numbers that is irrelevant to actionable guidance, quotes an inspiration source at length, and has extensive meta-commentary ('O gap fechado', 'Antes do v2.7.0...') that doesn't help Claude execute tasks. The 'Runtime Feedback Sensors' section alone is longer than the entire core operational content and much of it is conceptual rather than instructional.	1 / 3
Actionability	The workflows (SLO-driven feature work, log anomaly detection, response quality sampling) provide step-by-step sequences but lack executable code — they use pseudocode-style numbered lists rather than copy-paste ready commands or code snippets. The checklist and anti-patterns are useful but remain at the level of guidelines rather than concrete implementations. No actual code examples for structured logging, metrics instrumentation, or health check endpoints are provided.	2 / 3
Workflow Clarity	The three workflows (SLO-driven, log anomaly, response quality) have clear sequences and the SLO workflow includes a validation checkpoint (step 4 with budget comparison and rollback consideration). However, the rollback decision is vague ('rollback considerado' rather than explicit criteria and commands), and the log anomaly and response quality workflows lack explicit validation/error recovery steps. The checklist section is useful but disconnected from the workflows.	2 / 3
Progressive Disclosure	The skill references several external files (policies, templates, docs/skill-guides/observability-sre.md) and has a clear 'when to use' / 'when not to use' structure. However, the main body is monolithic — the Runtime Feedback Sensors section is extremely long and could be split into a separate reference file. The integration table and roadmap sections add bulk that would be better as linked references. No bundle files are provided to verify referenced paths exist.	2 / 3
	Total	7 / 12 Passed

Description

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a well-structured skill description with strong trigger term coverage in both Portuguese and English, clear 'when to use' guidance, and a distinct operational niche. Its main weakness is that the capability descriptions lean toward listing topics rather than specifying concrete actions (e.g., 'define logs' vs 'configure structured logging with correlation IDs, set up distributed tracing spans').

Suggestions

Replace the generic verb 'definir' with more specific action verbs describing what the skill actually does, e.g., 'configura logs estruturados, implementa tracing distribuído, define SLOs e error budgets, cria runbooks operacionais'.

Dimension	Reasoning	Score
Specificity	The description names the domain (observability and operational reliability) and lists several relevant concepts (logs, metrics, tracing, alerts, health checks, readiness, error budgets, rollback), but these read more as a list of topics than concrete actions. The verb 'definir' (define) is used but is somewhat vague about what specific actions are performed.	2 / 3
Completeness	The description clearly answers both 'what' (observability and operational reliability covering logs, metrics, tracing, alerts, health checks, etc.) and 'when' (explicit 'Use quando precisar...' clause plus a dedicated 'Trigger em:' section with specific keywords).	3 / 3
Trigger Term Quality	Excellent coverage of natural trigger terms in both Portuguese and English, including 'observabilidade', 'observability', 'SRE', 'logs estruturados', 'metricas', 'tracing distribuido', 'health check', 'readiness probe', 'error budget', 'SLO', 'alertas', 'rollback seguro', 'runbook operacional'. These are terms users would naturally use when needing this skill.	3 / 3
Distinctiveness Conflict Risk	The skill occupies a clear niche in observability and SRE practices with highly specific trigger terms like 'SRE', 'error budget', 'SLO', 'readiness probe', and 'tracing distribuido' that are unlikely to conflict with other skills.	3 / 3
	Total	11 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: felvieira/claude-skills-fv
Commit: 9e5d744

Reviewed: 12 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.