Guide for creating effective skills. This command should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations. Use when creating new skills, editing existing skills, or verifying skills work before deployment - applies TDD to process documentation by testing with subagents before writing, iterating until bulletproof against rationalization
69
62%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./plugins/customaize-agent/skills/create-skill/SKILL.mdQuality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a solid description that clearly identifies its niche (meta-skill for creating skills) with explicit 'Use when' triggers and natural keywords. The main weakness is that the specificity of concrete actions could be improved—the description mixes concrete actions with somewhat abstract language about 'specialized knowledge, workflows, or tool integrations' and the TDD/subagent methodology is mentioned but not fully elaborated. The tail end about 'bulletproof against rationalization' is somewhat jargon-heavy.
Suggestions
Replace abstract phrases like 'extends Claude's capabilities with specialized knowledge, workflows, or tool integrations' with more concrete actions such as 'writes SKILL.md files with proper YAML frontmatter, defines step-by-step workflows, and configures tool usage patterns'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain (creating skills) and some actions (creating, editing, verifying skills, TDD with subagents), but the actions are somewhat vague and not fully concrete—'extends Claude's capabilities with specialized knowledge, workflows, or tool integrations' is more abstract than specific. | 2 / 3 |
Completeness | Clearly answers both 'what' (guide for creating effective skills, applies TDD to process documentation by testing with subagents) and 'when' (explicit 'Use when creating new skills, editing existing skills, or verifying skills work before deployment'). | 3 / 3 |
Trigger Term Quality | Good coverage of natural terms users would say: 'create a new skill', 'update an existing skill', 'editing existing skills', 'verifying skills', 'skill' repeated multiple times. Users asking about skill creation would naturally use these terms. | 3 / 3 |
Distinctiveness Conflict Risk | This is a meta-skill about creating skills themselves, which is a very distinct niche. The triggers around 'skill creation', 'skill editing', and 'TDD with subagents' are unlikely to conflict with other skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
35%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a comprehensive but excessively verbose meta-skill that suffers from significant redundancy—the core TDD-for-skills concept is restated in at least 5 different ways across the document. While it provides a useful 6-step creation process and thorough checklist, the skill violates its own token efficiency guidelines (it would far exceed the <500 word target it recommends). The actionability is moderate: it gives good structural templates but lacks executable examples of the testing methodology it considers essential.
Suggestions
Reduce content by 60-70%: consolidate the TDD mapping table, RED-GREEN-REFACTOR section, checklist, and Bottom Line into a single authoritative workflow section, eliminating redundant restatements of the same concept.
Move the CSO optimization section, anti-patterns, bulletproofing/rationalization guidance, and flowchart usage rules into separate reference files (e.g., references/cso-guide.md, references/anti-patterns.md) to practice the progressive disclosure the skill itself advocates.
Remove the 'About Skills' and 'What is a Skill?' sections entirely—Claude already understands these concepts, and the skill's own guidelines say to assume Claude's competence.
Add a concrete, end-to-end worked example showing the actual commands and subagent prompts used to create a real skill through the RED-GREEN-REFACTOR cycle, rather than only describing the process abstractly.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | This skill is extremely verbose at ~3000+ words. It explains concepts Claude already knows (what a skill is, what PDFs are, what TDD is), repeats the same ideas multiple times (the TDD mapping appears as a table, then as prose, then as a checklist, then as 'The Bottom Line'), and includes extensive sections that could be dramatically compressed. The 'About Skills' and 'What is a Skill?' sections are largely redundant. Anti-patterns, flowchart usage guidelines, and CSO optimization tips are all overly detailed for what could be conveyed in a fraction of the tokens. | 1 / 3 |
Actionability | The skill provides a concrete step-by-step process (Steps 1-6) with specific directory structures, YAML examples, and a checklist. However, much of the guidance is meta-level advice rather than executable commands. There's no actual executable code—the examples are mostly markdown templates and YAML snippets. The 'pressure scenario' testing approach references a sub-skill (testing-skills-with-subagents) without providing the actual methodology inline, leaving a gap in actionability. | 2 / 3 |
Workflow Clarity | The skill presents a clear 6-step creation process and a TDD-adapted checklist with RED/GREEN/REFACTOR phases. However, validation steps are mostly implicit ('verify agents now comply' without specifying how), and the workflow is spread across multiple sections that overlap (the TDD mapping table, the RED-GREEN-REFACTOR section, the checklist, and the Step-by-step process all describe similar workflows with different levels of detail). The critical testing methodology is deferred to an external skill, creating a gap in the workflow. | 2 / 3 |
Progressive Disclosure | The skill references external resources (testing-skills-with-subagents, graphviz-conventions.dot, persuasion-principles.md, apply-anthropic-skill-best-practices) which is good progressive disclosure. However, the SKILL.md itself is monolithic—it contains extensive inline content that could be split into reference files (e.g., the CSO section, the anti-patterns section, the bulletproofing section). No bundle files are provided despite the content clearly warranting them. The document would benefit from being its own example of good skill structure. | 2 / 3 |
Total | 7 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (843 lines); consider splitting into references/ and linking | Warning |
Total | 10 / 11 Passed | |
dedca19
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.