CtrlK
BlogDocsLog inGet started
Tessl Logo

deepgram-js-text-to-speech

Use when writing or reviewing JavaScript/TypeScript in this repo that calls Deepgram Text-to-Speech v1 (`/v1/speak`) for audio synthesis. Covers one-shot REST via `client.speak.v1.audio.generate` and streaming WebSocket via `client.speak.v1.createConnection()` / `connect()`. Use `deepgram-js-voice-agent` when you need full-duplex STT + LLM + TTS instead of one-way synthesis. Triggers include "TTS", "text to speech", "speak", "aura", "streaming TTS", and "speak.v1".

71

Quality

86%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Content

72%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a strong, well-organized skill that provides concrete, executable examples for both REST and WebSocket TTS paths. The gotchas section is particularly valuable, capturing non-obvious SDK behaviors. The main weaknesses are minor verbosity in a few sections and the lack of error-handling/validation guidance for the WebSocket workflow, which is a multi-step process prone to connection and encoding issues.

Suggestions

Add error handling and validation checkpoints to the WebSocket workflow (e.g., handling connection errors from `connect()`, checking `waitForOpen()` timeout, verifying audio payload type before processing).

Trim the 'Central product skills' section to 1-2 lines or move it to a footer — it's tangential to the TTS task and consumes tokens.

DimensionReasoningScore

Conciseness

Generally efficient but includes some unnecessary explanation (e.g., the 'Central product skills' section at the bottom, the 'When to use this product' section partially restates what Claude could infer). The gotchas section is valuable but a couple items border on obvious. Overall mostly lean but could be tightened.

2 / 3

Actionability

Provides fully executable, copy-paste-ready code examples for both REST and WebSocket paths, with specific method names, event types, and structured message formats. The gotchas section adds concrete, specific guidance that prevents real mistakes.

3 / 3

Workflow Clarity

The REST path is a single call and is clear. The WebSocket path shows the correct sequence (create → register handlers → connect → waitForOpen → sendText → sendFlush) but lacks explicit validation/error-handling checkpoints. For WebSocket operations where connection failures and audio payload mismatches are common, there are no feedback loops or error recovery steps mentioned.

2 / 3

Progressive Disclosure

Well-structured with clear sections progressing from quick starts to parameters to gotchas. References to example files, in-repo source paths, and external docs are clearly signaled and one level deep. The layered API reference section is particularly well-organized for discovery.

3 / 3

Total

10

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly specifies concrete actions (one-shot REST and streaming WebSocket TTS), provides explicit 'Use when' guidance, lists natural trigger terms, and proactively distinguishes itself from a related skill. It is concise, uses third-person voice, and covers all dimensions thoroughly.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: one-shot REST via `client.speak.v1.audio.generate`, streaming WebSocket via `client.speak.v1.createConnection()` / `connect()`, and references the specific API endpoint `/v1/speak`. Also distinguishes from a related skill (`deepgram-js-voice-agent`).

3 / 3

Completeness

Clearly answers both 'what' (calls Deepgram TTS v1 for audio synthesis via REST and WebSocket) and 'when' (explicit 'Use when' clause at the start, plus explicit trigger terms and a boundary condition distinguishing from the voice agent skill).

3 / 3

Trigger Term Quality

Explicitly lists natural trigger terms users would say: 'TTS', 'text to speech', 'speak', 'aura', 'streaming TTS', and 'speak.v1'. These cover common variations and natural language a user would use when requesting text-to-speech functionality.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive: scoped to a specific repo, a specific API (Deepgram TTS v1), specific SDK methods, and explicitly differentiates from the related `deepgram-js-voice-agent` skill for full-duplex scenarios. Very unlikely to conflict with other skills.

3 / 3

Total

12

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
deepgram/deepgram-js-sdk
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.