Real User Monitoring (RUM), Web Vitals, user sessions, mobile crashes, page performance, user interactions, and frontend errors. Query web and mobile frontend telemetry.
59
68%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/dt-obs-frontends/SKILL.mdQuality
Discovery
72%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description has strong trigger term coverage and a distinctive niche in frontend observability/RUM, making it unlikely to conflict with other skills. However, it lacks an explicit 'Use when...' clause and could be more specific about the concrete actions it performs beyond just 'Query'. Adding explicit trigger guidance and more action-oriented language would elevate this description.
Suggestions
Add an explicit 'Use when...' clause, e.g., 'Use when the user asks about Real User Monitoring, Web Vitals scores, frontend performance issues, mobile app crashes, or user session analysis.'
Replace the generic 'Query web and mobile frontend telemetry' with more specific actions, e.g., 'Analyze page load times, diagnose Core Web Vitals regressions, investigate mobile crash reports, and trace user session journeys.'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain (RUM, Web Vitals, frontend telemetry) and lists several relevant concepts (user sessions, mobile crashes, page performance, user interactions, frontend errors), but doesn't describe concrete actions beyond 'Query'. It lists topics more than specific capabilities like 'analyze session replays' or 'diagnose slow page loads'. | 2 / 3 |
Completeness | The 'what' is partially addressed (query web and mobile frontend telemetry), but there is no explicit 'Use when...' clause or equivalent trigger guidance. The description only implies when it should be used through the listed topics, which caps this at 2 per the rubric guidelines. | 2 / 3 |
Trigger Term Quality | Includes strong natural keywords users would actually say: 'Real User Monitoring', 'RUM', 'Web Vitals', 'mobile crashes', 'page performance', 'frontend errors', 'user sessions', 'user interactions'. These cover a good range of terms a user would naturally use when asking about frontend observability. | 3 / 3 |
Distinctiveness Conflict Risk | The description carves out a clear niche around frontend/RUM telemetry, distinguishing it from backend monitoring, APM, or general analytics skills. Terms like 'RUM', 'Web Vitals', 'mobile crashes', and 'frontend errors' are specific enough to avoid conflicts with other observability or data querying skills. | 3 / 3 |
Total | 10 / 12 Passed |
Implementation
64%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a comprehensive and highly actionable frontend observability skill with excellent concrete DQL examples and valuable domain-specific knowledge (session naming conventions, zombie sessions, time window gotchas). Its main weaknesses are excessive length — the troubleshooting section contains generic debugging advice Claude doesn't need, and substantial content that could be offloaded to reference files is kept inline. The core workflows section is underdeveloped compared to the detailed playbook, serving mainly as a pointer to reference files without substantive step-by-step guidance.
Suggestions
Move the Slow Page Load Playbook and its ~15 DQL queries into a dedicated reference file (e.g., references/slow-page-playbook.md) and keep only the heuristic summary inline to reduce SKILL.md length significantly.
Trim the Troubleshooting section by removing generic debugging advice (Common Investigation Steps, Decision Tree) that Claude already knows, keeping only the RUM-specific gotchas (zero results due to session delay, zombie sessions, synthetic traffic filtering).
Add explicit validation/verification steps to the Core Workflows — e.g., after diagnosing a performance issue, include a step to verify the fix by re-running the baseline query and comparing results.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is quite long (~400+ lines) with significant verbosity in sections like Troubleshooting (decision trees, red flags, common investigation steps) and the detailed session data documentation. While much content is genuinely useful reference material (metric names, field mappings, DQL queries), the troubleshooting section over-explains diagnostic reasoning that Claude already knows, and the 'When to Use This Skill' section restates what's obvious from context. The zombie sessions and session creation delay explanations are valuable domain knowledge, but the decision tree and 'Common Investigation Steps' sections are generic debugging advice. | 2 / 3 |
Actionability | The skill excels at actionability with numerous executable DQL queries covering the full Slow Page Load Playbook (TTFB, long tasks, bundle size, cache, compression, DNS, third-party analysis). Field names are specific and concrete, the session field naming table with explicit 'NOT this' warnings is highly actionable, and performance thresholds provide clear numeric benchmarks. Queries are copy-paste ready with realistic filter patterns. | 3 / 3 |
Workflow Clarity | The Slow Page Load Playbook provides a clear diagnostic workflow with heuristics mapping symptoms to causes, and the troubleshooting section has a structured decision tree. However, the five 'Core Workflows' in the middle are just pointers to reference files without actual step sequences or validation checkpoints. The playbook lacks explicit validation steps (e.g., 'confirm the fix improved metrics') and there's no feedback loop for iterative diagnosis beyond the initial heuristic matching. | 2 / 3 |
Progressive Disclosure | The skill references multiple external files (WebVitals.md, user-sessions.md, error-tracking.md, mobile-monitoring.md, performance-analysis.md) with a clear 'Progressive Disclosure' section and 'Reference Files' listing. However, no bundle files were provided to verify these exist, and the SKILL.md itself is quite long — the extensive session data documentation, full playbook with ~15 queries, and lengthy troubleshooting section could arguably live in reference files rather than inline, making the main skill a wall of text despite good structural headers. | 2 / 3 |
Total | 9 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (610 lines); consider splitting into references/ and linking | Warning |
Total | 10 / 11 Passed | |
7cbe1ef
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.