Use this skill to validate the "why" before building, run product diagnostics, and pressure-test product direction before the request becomes an implementation contract.
49
49%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Advisory
Suggest reviewing before use
Quality
Discovery
25%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This description relies heavily on abstract, jargon-laden language that fails to communicate concrete actions or natural trigger terms. While it hints at a product validation/strategy niche, the buzzword-heavy phrasing ('validate the why', 'pressure-test product direction', 'implementation contract') makes it difficult for Claude to reliably select this skill. It also uses second person ('Use this skill') which, while not as penalized as first person, contributes to a less professional tone.
Suggestions
Replace abstract phrases with concrete actions, e.g., 'Evaluates product ideas against user needs, identifies risks in product assumptions, and challenges feature requests with structured frameworks.'
Add an explicit 'Use when...' clause with natural trigger terms like 'should I build this feature', 'product validation', 'feature justification', 'product-market fit', 'prioritize features'.
Rewrite in third person active voice describing what the skill does, e.g., 'Runs structured product validation checks...' instead of 'Use this skill to...'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description uses vague, buzzword-heavy language like 'validate the why', 'run product diagnostics', and 'pressure-test product direction'. These are abstract phrases that don't describe concrete, specific actions Claude would perform. | 1 / 3 |
Completeness | It attempts to answer both 'what' (validate the why, run diagnostics, pressure-test direction) and 'when' ('before the request becomes an implementation contract'), but the 'when' clause is vague and metaphorical rather than providing explicit, actionable triggers. The lack of a clear 'Use when...' clause with concrete trigger scenarios caps this at 2. | 2 / 3 |
Trigger Term Quality | The terms used ('product diagnostics', 'pressure-test product direction', 'implementation contract') are not natural keywords a user would type. Users are more likely to say things like 'should I build this', 'product strategy', 'feature prioritization', or 'validate idea'. | 1 / 3 |
Distinctiveness Conflict Risk | The skill occupies a somewhat identifiable niche around product validation/strategy, but the vague language ('product direction', 'diagnostics') could overlap with general product management, strategy, or planning skills. It's not generic enough to conflict with everything but not distinct enough to clearly stand apart. | 2 / 3 |
Total | 6 / 12 Passed |
Implementation
50%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill provides a useful conceptual framework for product thinking with four distinct modes, each with structured question sets. Its main weaknesses are the lack of concrete output templates (e.g., a PRODUCT-BRIEF.md skeleton), missing validation/feedback loops between steps, and some unnecessary conversational filler. It reads more as a thinking framework than an actionable skill with executable guidance.
Suggestions
Add a concrete PRODUCT-BRIEF.md template or schema showing the expected output structure, so Claude knows exactly what artifact to produce.
Include decision criteria or a flowchart for selecting which mode to use based on user input signals.
Add validation checkpoints within modes — e.g., in Mode 1, after gathering answers to questions 1-3, verify the problem is real before proceeding to solution framing.
Remove conversational filler like 'Like YC office hours but automated' and 'not more founder-theater' to improve token efficiency.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is reasonably structured but includes some unnecessary filler phrases ('Like YC office hours but automated', 'not more founder-theater', 'not vibes') and could be tightened. Some explanatory text is redundant given Claude's intelligence, but it's not egregiously verbose. | 2 / 3 |
Actionability | The skill provides structured question frameworks and scoring rubrics (ICE scoring, PMF signals), which give concrete guidance. However, the instructions are more like checklists of what to think about rather than executable steps — there are no concrete commands, file templates, or copy-paste-ready artifacts. The 'output a PRODUCT-BRIEF.md' instruction lacks a template or schema. | 2 / 3 |
Workflow Clarity | Each mode has a numbered sequence of steps which provides decent structure. However, there are no validation checkpoints, no feedback loops for when diagnostics reveal ambiguity, and no clear decision criteria for choosing between modes. The handoff to 'product-capability' is mentioned but the trigger conditions are vague. | 2 / 3 |
Progressive Disclosure | The content is organized into clear modes with an integration section pointing to related skills. However, all four modes are fully inline rather than being split into separate reference files, making this a moderately long single document. The cross-references to other skills/lanes are helpful but the internal structure could benefit from splitting detailed mode instructions into separate files. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
Reviewed
Table of Contents