Implement comprehensive observability for service meshes including distributed tracing, metrics, and visualization. Use when setting up mesh monitoring, debugging latency issues, or implementing SLOs for service communication.
Install with Tessl CLI
npx tessl i github:wshobson/agents --skill service-mesh-observability77
Does it follow best practices?
If you maintain this skill, you can automatically optimize it using the tessl CLI to improve its score:
npx tessl skill review --optimize ./path/to/skillAgent success when using this skill
Validation for skill structure
Prometheus alerting rules for Istio mesh
PrometheusRule CRD
100%
100%
istio-system namespace
100%
100%
reporter=destination filter
100%
100%
istio_requests_total metric
100%
100%
Error rate threshold
100%
100%
Error rate severity critical
0%
100%
Latency metric name
100%
100%
Latency threshold 1000ms
100%
100%
Latency severity warning
100%
0%
Alert for duration
100%
100%
Cert expiry alert
100%
100%
P99 quantile function
100%
100%
Without context: $0.4006 · 2m 4s · 12 turns · 61 in / 7,547 out tokens
With context: $0.5859 · 2m 7s · 28 turns · 339 in / 6,163 out tokens
Distributed tracing configuration and sampling
Jaeger image version
0%
100%
Zipkin port 9411
30%
100%
Dev sampling 100%
100%
100%
Prod sampling 1-10%
100%
100%
OTel batch timeout
0%
50%
OTel OTLP gRPC endpoint
100%
100%
OTel Zipkin receiver
100%
100%
Telemetry v2 randomSampling
100%
100%
Jaeger Zipkin env var
0%
100%
istio-system namespace
0%
100%
No 100% prod sampling
100%
100%
Without context: $0.6944 · 3m 19s · 21 turns · 22 in / 12,361 out tokens
With context: $1.0046 · 4m 21s · 31 turns · 29 in / 15,685 out tokens
Mesh visualization, dashboards, and metrics queries
Kiali prometheus URL
100%
100%
Kiali tracing URL
100%
100%
Kiali grafana URL
100%
100%
nodeGraph topology panel
0%
100%
reporter=destination in queries
100%
100%
istio_requests_total metric
100%
100%
Latency P99 PromQL
100%
100%
Scrape interval 15s
100%
100%
ServiceMonitor CRD
100%
100%
Kiali CRD apiVersion
0%
100%
istio-system namespace placement
100%
100%
Error rate panel uses 5xx filter
100%
100%
Without context: $1.0326 · 4m 40s · 22 turns · 22 in / 21,974 out tokens
With context: $0.5618 · 2m 6s · 21 turns · 19 in / 7,401 out tokens
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.