CtrlK
BlogDocsLog inGet started
Tessl Logo

deepgram-python-text-to-speech

Use when writing or reviewing Python code in this repo that calls Deepgram Text-to-Speech v1 (`/v1/speak`) for audio synthesis. Covers one-shot REST (`client.speak.v1.audio.generate`) and streaming WebSocket (`client.speak.v1.connect`). Also covers the in-repo `deepgram.helpers.TextBuilder` for incremental text assembly before synthesis. Use `deepgram-python-voice-agent` when you need full-duplex STT + LLM + TTS with barge-in. Triggers include "TTS", "speak", "synthesize voice", "aura", "text to speech", "speak.v1", "TextBuilder".

75

Quality

92%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Content

85%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a strong, well-structured skill that provides concrete, executable guidance for both REST and WebSocket TTS workflows. The gotchas section adds significant value by documenting non-obvious failure modes. Minor verbosity in the introductory sections and the cross-promotion footer slightly reduce token efficiency, but overall the skill is highly actionable and well-organized.

DimensionReasoningScore

Conciseness

The skill is mostly efficient with good code examples and minimal fluff, but some sections could be tightened — e.g., the 'When to use this product' section restates what's already clear from the quick starts, and the 'Central product skills' section at the bottom adds tokens for cross-promotion that aren't directly actionable for TTS usage. The gotchas section is well-justified though.

2 / 3

Actionability

Provides fully executable, copy-paste-ready code for REST, WebSocket, async, and TextBuilder usage patterns. Specific model names, encoding formats, import paths, and method signatures are all concrete and directly usable.

3 / 3

Workflow Clarity

The two primary workflows (REST one-shot and WebSocket streaming) are clearly sequenced with explicit ordering constraints (e.g., 'send all text + flush + close BEFORE calling start_listening'). The gotchas section serves as validation checkpoints, calling out common failure modes like missing flush before close and auth header format.

3 / 3

Progressive Disclosure

Content is well-structured with a clear overview, quick starts for both modes, then progressively deeper sections (TextBuilder, async, parameters, API references). References to example files, reference.md, and external docs are one level deep and clearly signaled. No bundle files were provided, but the skill is self-contained enough that this is appropriate.

3 / 3

Total

11

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly defines its scope (Deepgram TTS v1), lists concrete API methods and patterns, provides abundant natural trigger terms, and explicitly delineates its boundary from a related skill. The description is concise yet comprehensive, and uses proper third-person voice throughout.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: one-shot REST audio generation, streaming WebSocket connection, TextBuilder for incremental text assembly. Also names specific API paths and method signatures like `client.speak.v1.audio.generate` and `client.speak.v1.connect`.

3 / 3

Completeness

Clearly answers both 'what' (Deepgram TTS v1 for audio synthesis via REST and WebSocket, plus TextBuilder) and 'when' (explicit 'Use when writing or reviewing Python code that calls Deepgram TTS'). Also includes a helpful negative boundary directing to another skill for full-duplex scenarios.

3 / 3

Trigger Term Quality

Excellent coverage of natural trigger terms explicitly listed: 'TTS', 'speak', 'synthesize voice', 'aura', 'text to speech', 'speak.v1', 'TextBuilder'. These cover both casual user language and technical terms users would naturally use.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive with a clear niche (Deepgram TTS v1 specifically), and explicitly differentiates itself from a related skill (`deepgram-python-voice-agent` for full-duplex STT + LLM + TTS), reducing conflict risk significantly.

3 / 3

Total

12

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
deepgram/deepgram-python-sdk
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.