Name: thousandeyes-test-trace-correlation
Rating: 70.39999999999999 (1 reviews)
Author: thousandeyes

thousandeyes-test-trace-correlation

Investigate failing ThousandEyes synthetic tests with MCP tools. Use when a user wants ThousandEyes test triage, service-map or trace-ID correlation, distributed-tracing checks, correlation across Observability Platforms, or evidence-backed root-cause analysis with optional code fixes.

Quality

85%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Quality

Content

70%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured diagnostic workflow skill with clear sequencing, good conditional logic, and appropriate progressive disclosure to supporting files. Its main weaknesses are moderate redundancy across sections (Required Behavior, Workflow, Guardrails, and Output Contract overlap significantly) and a lack of concrete executable examples such as sample MCP call payloads or example trace IDs. Tightening the repeated instructions and adding one concrete example call would meaningfully improve it.

Suggestions

Deduplicate overlapping instructions: the rules about checking all Observability Platforms and not stopping after the first hit appear in Required Behavior (#7-10), Workflow step 5 (#7), and Guardrails. Consolidate into one authoritative location.

Add at least one concrete example MCP tool call with parameters and expected response shape to improve actionability (e.g., a sample `get_network_app_synthetics_metrics` call with filter parameters).

Include a concrete trace ID format example and validation rule (e.g., '32-character hex string matching `/^[0-9a-f]{32}$/`') rather than the vague 'Validate the trace ID format before using it.'

Dimension	Reasoning	Score
Conciseness	The skill is mostly efficient and avoids explaining basic concepts, but it is quite verbose for what it conveys. Several rules in 'Required Behavior' are restated in the workflow steps (e.g., checking distributed tracing, enumerating all Observability Platforms), and phrases like 'Do this even when direct trace lookup is available, because the extra telemetry helps explain the problem more completely' are unnecessarily explanatory for Claude. The repeated emphasis on checking every Observability Platform appears in Required Behavior, Workflow step 5, Guardrails, and Output Contract.	2 / 3
Actionability	The skill names specific MCP tool calls (e.g., `list_network_app_synthetics_tests`, `get_service_map`, `get_network_app_synthetics_metrics`) and provides concrete parameters like `filter_dimension=TEST`, which is good. However, there are no executable code examples, no example MCP call payloads, and no concrete example of what a trace ID looks like or how to validate its format. The guidance is specific but not copy-paste ready.	2 / 3
Workflow Clarity	The workflow is clearly sequenced across 6 numbered phases with sub-steps, includes explicit validation checkpoints (e.g., 'If no recent failure, stop unless user wants historical analysis'), feedback loops (e.g., 'If evidence is weak, ask for one precise next data point'), and conditional branching (service map available vs. fallback). The approval gate before code changes is explicit. The sequence is logical and handles edge cases well.	3 / 3
Progressive Disclosure	The skill provides a clear overview with well-signaled one-level-deep references to reference.md (for metric names and correlation guidance) and examples.md (for output formatting). The main SKILL.md stays at the right level of abstraction, deferring detailed metric names and output templates to supporting files. Navigation is clear with specific instructions on when to load each file.	3 / 3
	Total	10 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong skill description that clearly identifies its domain (ThousandEyes synthetic test investigation), lists specific capabilities (triage, service-map correlation, trace-ID correlation, distributed-tracing, root-cause analysis with code fixes), and includes an explicit 'Use when' clause with multiple trigger scenarios. It uses proper third-person voice and occupies a distinct niche that minimizes conflict risk with other skills.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: investigating failing synthetic tests, service-map correlation, trace-ID correlation, distributed-tracing checks, cross-platform correlation, root-cause analysis, and optional code fixes.	3 / 3
Completeness	Clearly answers both 'what' (investigate failing ThousandEyes synthetic tests with MCP tools) and 'when' (explicit 'Use when' clause listing five specific trigger scenarios including triage, correlation, tracing, and root-cause analysis).	3 / 3
Trigger Term Quality	Includes strong natural trigger terms users would say: 'ThousandEyes', 'synthetic tests', 'failing', 'triage', 'service-map', 'trace-ID', 'distributed-tracing', 'root-cause analysis', 'MCP tools', 'Observability Platforms'. These cover the domain well with terms practitioners would naturally use.	3 / 3
Distinctiveness Conflict Risk	Highly distinctive with a clear niche: ThousandEyes-specific synthetic test investigation using MCP tools. The combination of ThousandEyes, synthetic tests, and MCP tools creates a very specific domain unlikely to conflict with other skills.	3 / 3
	Total	12 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: thousandeyes/thousandeyes-ai-agents-toolkit
Commit: d0f0d0f

Reviewed: about 1 month ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.