CtrlK
BlogDocsLog inGet started
Tessl Logo

whisper

Transcribe audio files to text using OpenAI Whisper

67

Quality

58%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./examples/skill/skills/whisper/SKILL.md
SKILL.md
Quality
Evals
Security

Whisper Audio Transcription Skill

Transcribe audio files to text using OpenAI Whisper.

Capabilities

  • Transcribe audio files (MP3, WAV, M4A, FLAC, OGG, etc.) to text
  • Support for 90+ languages with auto-detection
  • Optional timestamp generation
  • Multiple model sizes (tiny/base/small/medium/large)
  • Output in plain text or JSON format

Usage

Basic Transcription

python3 scripts/transcribe.py <audio_file> <output_file>

With Options

# Specify model size (default: base)
python3 scripts/transcribe.py audio.mp3 transcript.txt --model medium

# Specify language (improves accuracy)
python3 scripts/transcribe.py audio.mp3 transcript.txt --language zh

# Include timestamps
python3 scripts/transcribe.py audio.mp3 transcript.txt --timestamps

# JSON output with metadata
python3 scripts/transcribe.py audio.mp3 output.json --format json

Parameters

  • audio_file (required): Path to input audio file
  • output_file (required): Path to output text/JSON file
  • --model: Whisper model size (tiny/base/small/medium/large, default: base)
  • --language: Language code (e.g., en, zh, es, fr, auto for detection)
  • --timestamps: Include word-level timestamps in output
  • --format: Output format (text/json, default: text)

Model Sizes

ModelParametersSpeedAccuracyMemory
tiny39M~32xGood~1GB
base74M~16xBetter~1GB
small244M~6xGreat~2GB
medium769M~2xExcellent~5GB
large1.5B1xBest~10GB

Supported Audio Formats

MP3, WAV, M4A, FLAC, OGG, AAC, WMA, and more (via FFmpeg)

Dependencies

  • Python 3.8+
  • openai-whisper
  • ffmpeg

Installation

pip install openai-whisper
sudo apt-get install ffmpeg  # Ubuntu/Debian
Repository
trpc-group/trpc-agent-go
Last updated
Created

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.