service-mesh-observability

Implement comprehensive observability for service meshes including distributed tracing, metrics, and visualization. Use when setting up mesh monitoring, debugging latency issues, or implementing SLOs for service communication.

1.39x

Quality

61%

Does it follow best practices?

Impact

95%

1.39x

Average score across 3 eval scenarios

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./tests/ext_conformance/artifacts/agents-wshobson/cloud-infrastructure/skills/service-mesh-observability/SKILL.md

Quality

Discovery

92%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong skill description that clearly articulates specific capabilities and includes an explicit 'Use when' clause with relevant trigger scenarios. The domain-specific terminology is well-chosen and natural. The only minor weakness is potential overlap with broader observability or monitoring skills, though the service mesh focus provides reasonable distinctiveness.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: 'distributed tracing', 'metrics', 'visualization', 'mesh monitoring', 'debugging latency issues', and 'implementing SLOs for service communication'. These are concrete, actionable capabilities.	3 / 3
Completeness	Clearly answers both 'what' (implement observability including distributed tracing, metrics, visualization) and 'when' (explicit 'Use when' clause covering mesh monitoring setup, debugging latency, implementing SLOs).	3 / 3
Trigger Term Quality	Includes strong natural keywords users would say: 'service mesh', 'distributed tracing', 'metrics', 'monitoring', 'latency issues', 'SLOs', 'observability', 'visualization'. These cover the domain well and match how practitioners naturally describe these tasks.	3 / 3
Distinctiveness Conflict Risk	While 'service mesh' narrows the scope, terms like 'observability', 'metrics', 'monitoring', and 'distributed tracing' could overlap with general monitoring/observability skills or APM-focused skills. The service mesh focus helps but doesn't fully eliminate conflict risk.	2 / 3
	Total	11 / 12 Passed

Implementation

29%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill provides rich, executable configuration templates for service mesh observability but suffers from being a monolithic dump of YAML/JSON without workflow sequencing or validation steps. It explains concepts Claude already knows, includes redundant queries across sections, and lacks any progressive disclosure structure. The actionable templates are its main strength, but the absence of a clear setup workflow and verification steps significantly limits its practical utility.

Suggestions

Add a clear sequential workflow (e.g., 1. Install Prometheus → verify scraping, 2. Deploy Jaeger → verify traces flowing, 3. Configure dashboards → validate panels) with explicit validation checkpoints at each step.

Move large YAML/JSON blocks (Grafana dashboard, alerting rules, OTel config) into separate referenced files and keep SKILL.md as a concise overview with pointers.

Remove the 'Core Concepts' section entirely (three pillars, golden signals table) — Claude already knows these concepts. Replace with a brief note on which golden signals the templates implement.

Eliminate redundant PromQL queries that appear in both the 'Key Metrics Queries' section and the Grafana dashboard JSON.

Dimension	Reasoning	Score
Conciseness	The skill is extremely verbose at ~300+ lines, with significant redundancy (same PromQL queries appear in both the metrics section and the Grafana dashboard JSON). It includes conceptual explanations Claude already knows (three pillars of observability, golden signals table), ASCII art diagrams, and external resource links that add little value. Much of this could be cut by 60%+ without losing actionable content.	1 / 3
Actionability	The skill provides fully executable YAML manifests, PromQL queries, bash commands, and JSON dashboard configurations that are copy-paste ready. Each template is concrete and specific with real configuration values, ports, and endpoints.	3 / 3
Workflow Clarity	There is no clear sequenced workflow for setting up observability. Templates are presented as isolated blocks without ordering, dependencies between them, or validation checkpoints. There's no guidance on verifying that Prometheus is scraping correctly, that traces are flowing, or how to diagnose failures in the observability stack itself.	1 / 3
Progressive Disclosure	All content is in a single monolithic file with no references to supporting files. The Grafana dashboard JSON, alerting rules, and OTel configuration could easily be split into separate referenced files. The skill is a wall of YAML/JSON that would benefit greatly from a concise overview pointing to detailed templates.	1 / 3
	Total	6 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: Dicklesworthstone/pi_agent_rust
Commit: 99da384

Reviewed: about 8 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.