text-to-speech

tessl i github:martinholovsky/claude-skills-generator --skill text-to-speech

Expert skill for implementing text-to-speech with Kokoro TTS. Covers voice synthesis, audio generation, performance optimization, and secure handling of generated audio for JARVIS voice assistant.

Review Score

65%

Validation Score

12/16

Implementation Score

77%

Activation Score

33%

SKILL.md

Review

Evals

Generated

about 4 hours ago

Warnings & errors only

Criteria	Score
description_trigger_hint	Description may be missing an explicit 'when to use' trigger hint (e.g., 'Use when...')
metadata_version	'metadata' field is not a dictionary
license_field	'license' field is missing
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata

Overall Assessment

This is a well-structured skill with excellent actionability through complete, executable code examples and clear TDD workflows with validation checkpoints. However, it suffers from being overly verbose with unnecessary framing ('You are an expert...') and could benefit from better progressive disclosure by moving detailed implementations to reference files as initially suggested.

Suggestions

Remove the 'You are an expert' framing and 'You excel at' lists - Claude doesn't need this context about itself
Move the full SecureTTSEngine implementation (Pattern 1) and detailed test file to referenced files, keeping only concise examples in the main skill
Consolidate the Performance Patterns section with Implementation Patterns to reduce redundancy between streaming/caching examples

Dimension	Score	Reasoning
Conciseness	2/3	The skill contains some unnecessary verbosity, particularly in the overview section explaining what Claude excels at and the 'You are an expert' framing. The content could be tightened by removing redundant explanations and consolidating similar patterns.
Actionability	3/3	Provides fully executable Python code examples throughout, including complete test files, implementation patterns with BAD/GOOD comparisons, and copy-paste ready commands for verification. The SecureTTSEngine implementation is comprehensive and runnable.
Workflow Clarity	3/3	Clear TDD workflow with explicit steps (write test → implement → refactor → verify), comprehensive checklists with validation checkpoints, and explicit verification commands. The pre-implementation checklist provides clear feedback loops for error recovery.
Progressive Disclosure	2/3	The file mentions 'See references/ for detailed implementations' but then includes extensive inline content that could be split. The 400+ line document contains full implementations that would be better organized in separate reference files, making the main skill harder to scan.

Overall Assessment

The description identifies the technology stack (Kokoro TTS, JARVIS) and general capability areas but lacks the explicit trigger guidance essential for skill selection. It uses appropriate third-person voice but relies on category labels rather than concrete actions, and omits common user phrasings that would help Claude match this skill to requests.

Suggestions

Add a 'Use when...' clause with explicit triggers like 'Use when the user asks to convert text to speech, generate voice audio, or mentions Kokoro, TTS, or JARVIS voice features'.
Include common user phrasings and variations such as 'TTS', 'speak text', 'voice output', 'speech synthesis', and file extensions like '.wav' or '.mp3'.
Replace generic category terms with specific actions like 'convert text strings to audio files', 'select from multiple voice profiles', or 'stream real-time speech output'.

Dimension	Score	Reasoning
Specificity	2/3	Names the domain (text-to-speech, Kokoro TTS) and lists some actions (voice synthesis, audio generation, performance optimization), but these are somewhat generic categories rather than concrete specific actions like 'convert text to WAV files' or 'adjust speech rate and pitch'.
Completeness	1/3	Describes what the skill does but completely lacks a 'Use when...' clause or any explicit trigger guidance. There is no indication of when Claude should select this skill over others.
Trigger Term Quality	2/3	Includes relevant keywords like 'text-to-speech', 'Kokoro TTS', 'voice synthesis', 'audio generation', and 'JARVIS', but misses common user variations like 'TTS', 'speech synthesis', 'voice output', 'speak text', or 'generate speech'.
Distinctiveness Conflict Risk	2/3	The mention of 'Kokoro TTS' and 'JARVIS voice assistant' provides some distinctiveness, but 'text-to-speech' and 'audio generation' are broad enough to potentially overlap with other audio or voice-related skills.

Repository: github.com/martinholovsky/claude-skills-generator
Last updated: about 4 hours ago
Created: about 4 hours ago

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.