Local speech-to-text with the Whisper CLI (no API key).
76
70%
Does it follow best practices?
Impact
91%
0.91xAverage score across 3 eval scenarios
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./openclaw/skills/openai-whisper/SKILL.mdQuality
Discovery
40%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description identifies a clear and distinct niche (local Whisper CLI speech-to-text without an API key), which makes it highly distinguishable. However, it lacks specific concrete actions (e.g., transcribe audio, generate subtitles) and critically omits a 'Use when...' clause with natural trigger terms users would employ.
Suggestions
Add a 'Use when...' clause such as 'Use when the user wants to transcribe audio files, convert speech to text locally, or generate subtitles without an API key.'
List specific actions like 'Transcribes audio files, generates subtitles/SRT files, supports multiple audio formats (.mp3, .wav, .m4a).'
Include natural trigger terms users would say: 'transcribe', 'audio', 'transcription', 'voice to text', 'subtitle', 'SRT'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Names the domain (speech-to-text) and the tool (Whisper CLI), and clarifies 'no API key' which is a useful detail, but doesn't list specific actions like transcribing audio files, generating subtitles, converting formats, etc. | 2 / 3 |
Completeness | Describes what it does at a high level (speech-to-text with Whisper CLI) but has no 'Use when...' clause or explicit trigger guidance, which per the rubric should cap completeness at 2, and since the 'what' is also quite thin, this lands at 1. | 1 / 3 |
Trigger Term Quality | Includes 'speech-to-text', 'Whisper', and 'CLI' which are relevant, but misses common user terms like 'transcribe', 'audio', 'transcription', '.mp3', '.wav', 'subtitle', 'SRT', or 'voice to text'. | 2 / 3 |
Distinctiveness Conflict Risk | The combination of 'speech-to-text', 'Whisper CLI', and 'no API key' creates a very specific niche that is unlikely to conflict with other skills. It clearly distinguishes itself from API-based transcription or other audio processing skills. | 3 / 3 |
Total | 8 / 12 Passed |
Implementation
100%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is an excellent, minimal skill that does exactly what it needs to. It provides concrete, executable commands, avoids explaining concepts Claude already knows, and is well-structured for its scope. The only minor improvement would be adding a heading marker (##) before 'Quick start' and 'Notes' for consistent markdown formatting, but this is cosmetic.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | Extremely lean — no unnecessary explanations, no description of what Whisper is or how speech-to-text works. Every line adds value. | 3 / 3 |
Actionability | Provides fully executable, copy-paste-ready CLI commands with realistic flags and file paths. The two examples cover the primary use cases (transcribe and translate) with different output formats. | 3 / 3 |
Workflow Clarity | This is a simple, single-purpose skill (run a CLI command). The single action is unambiguous, and the notes section covers the key operational details (model caching, defaults). No multi-step or destructive operations require validation checkpoints. | 3 / 3 |
Progressive Disclosure | For a skill under 50 lines with no need for external references, the content is well-organized into a quick start section and notes. No unnecessary nesting or missing structure. | 3 / 3 |
Total | 12 / 12 Passed |
Validation
72%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 8 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
metadata_version | 'metadata.version' is missing | Warning |
metadata_field | 'metadata' should map string keys to string values | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 8 / 11 Passed | |
0c1ec2b
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.