Grounds every implementation decision in official documentation. Use when you want authoritative, source-cited code free from outdated patterns. Use when building with any framework or library where correctness matters.
45
47%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/source-driven-development/SKILL.mdQuality
Discovery
17%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This description suffers from vague, aspirational language and lacks concrete actions or specific trigger terms. The 'Use when' clauses are present but so broad ('any framework or library where correctness matters') that they provide almost no discriminative value for skill selection. The description reads more like marketing copy than a functional skill selector.
Suggestions
Replace vague language with concrete actions, e.g., 'Looks up official documentation, validates API usage against current docs, checks for deprecated patterns, and cites documentation sources in code comments.'
Narrow the 'Use when' clause with specific triggers, e.g., 'Use when the user asks to check documentation, verify API correctness, find the latest syntax for a library, or mentions terms like "docs", "official API", "deprecated", or "latest version".'
Add distinctiveness by specifying what makes this different from general coding skills, e.g., mentioning the documentation lookup mechanism or specific documentation sources it consults.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description uses vague, abstract language like 'grounds every implementation decision' and 'authoritative, source-cited code free from outdated patterns.' It does not list any concrete actions (e.g., 'looks up documentation,' 'validates API usage,' 'checks version compatibility'). The capabilities are described in buzzword-heavy, aspirational terms rather than specific operations. | 1 / 3 |
Completeness | It has 'Use when...' clauses, which is positive, but the 'what' is extremely vague ('grounds every implementation decision') and the 'when' is overly broad ('building with any framework or library where correctness matters'). The triggers are so generic they don't provide meaningful guidance for skill selection. | 2 / 3 |
Trigger Term Quality | The description lacks natural keywords a user would actually say. Terms like 'authoritative,' 'source-cited,' and 'outdated patterns' are not phrases users naturally use when requesting help. Missing practical trigger terms like 'documentation,' 'docs,' 'API reference,' 'check the docs,' 'latest version,' or specific framework names. | 1 / 3 |
Distinctiveness Conflict Risk | The description is extremely generic — 'any framework or library where correctness matters' could apply to virtually any coding skill. It would conflict with nearly every development-related skill since most aim to produce correct code. There is no clear niche or distinguishing characteristic. | 1 / 3 |
Total | 5 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-crafted skill with excellent actionability and workflow clarity. The four-step process is concrete, well-sequenced, and includes proper validation and error handling. The main weakness is verbosity — the skill includes motivational/persuasive content (rationalizations table, red flags) that overlaps with the process steps and explains concepts Claude already understands, consuming tokens without adding proportional value.
Suggestions
Remove or significantly condense the 'Common Rationalizations' table — Claude doesn't need to be persuaded why verification matters; it just needs the process.
Merge 'Red Flags' into the verification checklist as negative checks, eliminating redundancy with the process steps.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is well-written but includes some unnecessary verbosity. The 'Common Rationalizations' table, while useful, is somewhat preachy and explains things Claude already understands (e.g., why hallucinating is bad). The 'When to Use' / 'When NOT to use' sections and the 'Red Flags' section overlap significantly with content already covered in the process steps. The ASCII diagram adds little value. Overall, the content could be tightened by ~30% without losing actionable information. | 2 / 3 |
Actionability | The skill provides highly concrete, actionable guidance: specific file names to check for versions, a clear source hierarchy table, exact examples of good vs bad fetch targets, specific code examples with citations, and templates for conflict resolution and unverified patterns. Every step tells Claude exactly what to do with concrete examples. | 3 / 3 |
Workflow Clarity | The four-step process (Detect → Fetch → Implement → Cite) is clearly sequenced with explicit validation checkpoints. The skill includes feedback loops for conflicts (surface to user), unverified patterns (flag explicitly), and a comprehensive verification checklist at the end. The workflow handles edge cases like missing versions, conflicting docs, and unverifiable patterns. | 3 / 3 |
Progressive Disclosure | The content is well-structured with clear headers and sections, but it's a monolithic document with no references to external files. At ~200 lines, some content (like the Common Rationalizations table, the Red Flags list, or detailed examples for specific frameworks) could be split into separate reference files. However, since no bundle files are provided, the skill is evaluated in isolation and the organization within the single file is reasonable. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
f17c6e8
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.