This skill should be used when the user asks to create a new skill, build a skill, make a custom skill, develop a CLI skill, or wants to extend the CLI with new capabilities. Automates the entire skill creation workflow from brainstorming to installation.
68
68%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Quality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description has excellent trigger term coverage and completeness, clearly stating both when to use it and what it does. Its main weakness is that the 'what' portion is somewhat high-level—'automates the entire skill creation workflow from brainstorming to installation' could benefit from listing specific concrete steps or actions. Overall, it's a solid description that would perform well in skill selection.
Suggestions
Add more specific concrete actions to the 'what' portion, e.g., 'generates skill YAML frontmatter, writes markdown instructions, creates file structure, and installs the skill into the CLI'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | It names the domain (skill creation) and mentions 'automates the entire skill creation workflow from brainstorming to installation,' but doesn't list specific concrete actions beyond that high-level summary. What does 'brainstorming' entail? What steps are involved? | 2 / 3 |
Completeness | Explicitly answers both 'when' ('when the user asks to create a new skill, build a skill, make a custom skill...') and 'what' ('Automates the entire skill creation workflow from brainstorming to installation'). The 'Use when' equivalent is clearly stated at the beginning. | 3 / 3 |
Trigger Term Quality | Includes strong natural trigger terms: 'create a new skill', 'build a skill', 'make a custom skill', 'develop a CLI skill', 'extend the CLI with new capabilities'. These cover multiple natural phrasings a user would actually say. | 3 / 3 |
Distinctiveness Conflict Risk | The concept of 'skill creation' and 'extending the CLI' is a very specific niche. The trigger terms are distinct and unlikely to overlap with other skills unless there were multiple skill-creation skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
35%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill has a well-thought-out multi-phase workflow for creating CLI skills, with good error handling coverage and installation options. However, it is severely bloated—the repeated progress bar displays in multiple formats, ASCII art boxes for every phase, and explanations of basic operations inflate the token cost dramatically. The reliance on undefined variables and external templates/scripts that aren't provided limits true actionability.
Suggestions
Reduce token usage by at least 60%: show the progress bar format ONCE with a note to repeat it per phase, eliminate duplicate ASCII art boxes, and remove explanations of basic concepts like symlinks and sed.
Define how user input variables ($USER_INPUT, $PLATFORM, $DESCRIPTION) are captured and stored, or provide a concrete input-gathering script rather than leaving them undefined.
Move the detailed error handling examples, progress bar formatting specs, and template substitution details into reference files, keeping SKILL.md as a lean workflow overview.
Include at least one complete, minimal template inline (or as a bundled file with actual content) so the skill is functional without external dependencies that aren't provided.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | Extremely verbose at ~400+ lines. Massive amounts of redundant progress bar displays (shown in multiple formats for every phase), ASCII art boxes repeated for each step, and explanations of basic concepts like symlinks and sed. The progress tracking alone consumes enormous token budget with near-identical repeated blocks. Much of this could be condensed to a fraction of the size. | 1 / 3 |
Actionability | Contains concrete bash commands and file operations, but relies heavily on external templates (skill-template-copilot.md, validate-skill-yaml.sh, etc.) that aren't provided. The bash snippets use undefined variables like $USER_INPUT, $PLATFORM, $DESCRIPTION without showing how they're captured. The code is illustrative rather than truly executable end-to-end. | 2 / 3 |
Workflow Clarity | The 5-phase workflow is clearly sequenced and includes validation in Phase 4 with error recovery options. However, the validation step happens after file generation but relies on external scripts that aren't included. The error handling section covers edge cases well, but the feedback loop for validation failures is somewhat vague ('Offer to fix automatically' without specifying how). The workflow is also muddied by the excessive progress bar formatting that obscures the actual steps. | 2 / 3 |
Progressive Disclosure | References a references/ directory and external files appropriately, and mentions bundled resources at the end. However, the SKILL.md itself is monolithic with enormous inline content that should be split out. The progress bar formatting, detailed error handling examples, and template substitution details could all live in reference files. The references section at the end is well-structured but the main body doesn't practice what it preaches about progressive disclosure. | 2 / 3 |
Total | 7 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (594 lines); consider splitting into references/ and linking | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 9 / 11 Passed | |
Reviewed
Table of Contents