How to actually instrument product analytics correctly. Event taxonomy, property design, naming conventions, schema versioning, identity stitching, funnel design, retention cohorts, North Star metric selection, dashboard hygiene, instrumentation debt, and the failure modes that produce data nobody trusts. Triggers on product analytics setup, event taxonomy, tracking plan, instrumentation, schema versioning, North Star metric, retention cohorts, funnel design, naming conventions, instrument new feature, audit existing analytics, dashboard reconciliation, instrumentation debt, Mixpanel setup, Amplitude setup, PostHog setup, warehouse-native analytics. Also triggers when the team has data but cannot trust it, or when designing instrumentation for a new feature, or when auditing an existing setup that has drifted.
53
60%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/product-analytics-setup/SKILL.mdQuality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong, comprehensive skill description that clearly defines its domain (product analytics instrumentation), lists numerous specific capabilities, and provides explicit trigger conditions covering both keyword-based and situational triggers. The description is thorough without being vague, and the inclusion of specific tool names and scenario-based triggers ('when the team has data but cannot trust it') makes it highly actionable for skill selection. Minor concern is that it's quite verbose, but the content is substantive rather than padded.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists many specific concrete actions and concepts: event taxonomy, property design, naming conventions, schema versioning, identity stitching, funnel design, retention cohorts, North Star metric selection, dashboard hygiene, instrumentation debt, and failure mode identification. | 3 / 3 |
Completeness | Clearly answers both 'what' (event taxonomy, property design, naming conventions, schema versioning, etc.) and 'when' with explicit trigger guidance ('Triggers on...', 'Also triggers when the team has data but cannot trust it, or when designing instrumentation for a new feature, or when auditing an existing setup'). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural terms users would say: 'tracking plan', 'Mixpanel setup', 'Amplitude setup', 'PostHog setup', 'instrument new feature', 'audit existing analytics', 'event taxonomy', 'retention cohorts', 'funnel design', plus situational triggers like 'data but cannot trust it'. | 3 / 3 |
Distinctiveness Conflict Risk | Occupies a clear niche around product analytics instrumentation specifically, with distinct triggers like 'tracking plan', 'event taxonomy', 'instrumentation debt', and specific tool names (Mixpanel, Amplitude, PostHog) that are unlikely to conflict with general data analysis or other skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
20%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill reads as a well-written educational article about product analytics rather than an actionable playbook for Claude. Its primary weakness is the complete absence of executable artifacts—no code snippets, no schema templates, no tracking plan JSON, no SDK integration examples—despite being about instrumentation. It is also significantly over-verbose, spending substantial tokens explaining concepts (what retention is, what cohorts are, what funnels measure) that Claude already understands, rather than providing the specific, concrete guidance that would make instrumentation execution reliable.
Suggestions
Add executable code examples: SDK initialization for at least one platform (e.g., PostHog), a concrete tracking plan in JSON/TypeScript, a sample event payload, and a validation script or query to verify instrumentation correctness.
Cut conceptual explanations by 50-60%: Remove definitions of retention, cohorts, funnels, and properties that Claude already knows. Replace with terse rules and concrete examples only.
Move the 12 failure modes, retention curve interpretations, and cohort type explanations into reference files, keeping only 1-2 sentence summaries in the main skill.
Add a concrete workflow with validation checkpoints: e.g., 'Step 1: Generate tracking plan JSON → Step 2: Validate against schema → Step 3: Implement SDK calls → Step 4: Verify events in tool's live debugger → Step 5: Confirm dashboard queries match expected counts.'
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~2500+ words. It extensively explains concepts Claude already knows (what retention is, what cohorts are, what funnels are, what properties are). Entire sections like the instrumentation hierarchy, cohort definitions, and retention measurement are conceptual explanations rather than actionable instructions. The 'common failure modes' section lists 12 items with explanations that are largely intuitive. Much of this could be cut by 60-70% without losing actionable value. | 1 / 3 |
Actionability | Despite its length, the skill contains zero executable code, no concrete commands, no JSON schemas, no SQL queries, no API calls, and no copy-paste-ready artifacts. Everything is descriptive guidance ('pick ONE convention and enforce it', 'document the window explicitly'). The closest to concrete are naming examples like `checkout_completed`, but these are illustrative rather than executable. For a skill about instrumentation, the absence of actual instrumentation code (e.g., Mixpanel/Amplitude/PostHog SDK calls, tracking plan JSON, or validation scripts) is a significant gap. | 1 / 3 |
Workflow Clarity | The 12-consideration framework at the end provides a reasonable checklist sequence, and the schema versioning section has a clear additive-vs-breaking workflow. However, there are no explicit validation checkpoints, no feedback loops for error recovery, and no concrete 'run this then check that' sequences. The instrumentation audit is mentioned but deferred entirely to a reference file. For a skill involving schema changes and data migrations (destructive/batch operations), the lack of validation steps caps this at 2. | 2 / 3 |
Progressive Disclosure | The skill references 9 separate reference files with clear descriptions and relative paths, which is good structure. However, since no bundle files are provided, none of these references actually exist, making the progressive disclosure aspirational rather than functional. Additionally, the main SKILL.md itself is a wall of text that inlines extensive conceptual content (retention flavors, cohort types, funnel rules, failure modes) that should have been pushed to the reference files, leaving only actionable summaries in the main file. | 2 / 3 |
Total | 6 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
8e70d03
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.