Name: deepgram-python-speech-to-text
Rating: 80 (1 reviews)
Author: deepgram

deepgram-python-speech-to-text

Use when writing or reviewing Python code in this repo that calls Deepgram Speech-to-Text v1 (`/v1/listen`) for prerecorded or live audio transcription. Covers `client.listen.v1.media.transcribe_url` / `transcribe_file` (REST) and `client.listen.v1.connect` (WebSocket). Use this skill for basic ASR; use `deepgram-python-audio-intelligence` for summarize/sentiment/topics/diarize overlays, `deepgram-python-conversational-stt` for turn-taking v2/Flux, and `deepgram-python-voice-agent` for full-duplex assistants. Triggers include "transcribe", "live transcription", "speech to text", "STT", "listen endpoint", "nova-3", "listen.v1".

Quality

100%

Does it follow best practices?

Run evals on this skill

Adds up to 20 points to the overall score

View guide

Securityby

Low

Low-risk findings worth noting

Quality

Content

100%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body delivers dense, executable, Deepgram-specific guidance with genuinely non-obvious API semantics (flag states, async vs deferred, Bearer-vs-Token auth) and clear workflow sequencing, while keeping references one level deep and well-signaled. It is token-efficient for the complexity it covers.

Dimension	Reasoning	Score
Conciseness	The body is lean and code-driven; prose (interim/final flag semantics, the async-vs-deferred distinction, auth-scheme gotcha) covers non-obvious Deepgram-specific behavior Claude would not already know, so every section earns its tokens.	3 / 3
Actionability	Provides fully executable, copy-paste-ready examples for REST URL, REST file, WebSocket streaming, async, and webhook-callback patterns, plus specific response-extraction paths like response.results.channels[0].alternatives[0].transcript.	3 / 3
Workflow Clarity	The WebSocket flow is clearly sequenced (connect -> register handlers -> start_listening in a thread -> send chunks -> send_finalize -> close) with the send_finalize() clean-close checkpoint and a gotchas checklist covering failure modes; this is not a destructive/batch operation requiring a validate-fix-retry loop.	3 / 3
Progressive Disclosure	No bundle files exist in references/scripts/assets, so disclosure is judged on the body's organization: well-structured sections with one-level-deep, clearly signaled pointers (in-repo reference.md, OpenAPI/AsyncAPI URLs, Context7, product docs, and example files) and a layered API-reference navigation block.	3 / 3
	Total	12 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is specific, trigger-rich, complete, and clearly distinct from sibling Deepgram skills, all in concise third-person voice. It is a strong model of the 'Use when... + concrete capabilities + triggers' pattern.

Dimension	Reasoning	Score
Specificity	Lists multiple concrete actions — 'prerecorded or live audio transcription' — and names specific SDK methods 'transcribe_url / transcribe_file (REST) and connect (WebSocket)', matching the 'multiple specific concrete actions' anchor.	3 / 3
Completeness	Explicitly answers both what (Deepgram STT v1 transcription via REST/WebSocket) and when ('Use when writing or reviewing Python code in this repo that calls Deepgram Speech-to-Text v1') with an explicit triggers list.	3 / 3
Trigger Term Quality	'Triggers include "transcribe", "live transcription", "speech to text", "STT", "listen endpoint", "nova-3", "listen.v1"' gives good coverage of natural terms a user would actually say.	3 / 3
Distinctiveness Conflict Risk	Explicitly carves a niche ('basic ASR') and disambiguates three sibling skills (audio-intelligence, conversational-stt, voice-agent), making wrong-skill triggering unlikely.	3 / 3
	Total	12 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 16 / 16 Passed

Validation for skill structure

No warnings or errors.

Repository: deepgram/deepgram-python-sdk
Path: .agents/skills/deepgram-python-speech-to-text/SKILL.md
Commit: 7da415d

Reviewed: 1 day ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.