jbaruch/speaker-toolkit

Two-skill presentation system: analyze your speaking style into a rhetoric knowledge vault, then create new presentations that match your documented patterns. Includes an 88-entry Presentation Patterns taxonomy for scoring, brainstorming, and go-live preparation.

1.57x

Quality

96%

Does it follow best practices?

Impact

96%

1.57x

Average score across 15 eval scenarios

Download Commands Reference

Name: jbaruch/speaker-toolkit
Rating: 0.9604 (1 reviews)
Author: jbaruch

A. Download Transcript

Primary: yt-dlp

yt-dlp --write-auto-sub --sub-lang en --skip-download --sub-format vtt \
  -o "{vault_root}/transcripts/{youtube_id}" "https://www.youtube.com/watch?v={youtube_id}"

VTT Cleanup (required after yt-dlp download)

The .vtt file contains timestamps, cue position markers, and duplicate lines. Clean it before analysis:

Strip all timestamp lines (lines matching \d{2}:\d{2}.*-->)
Strip cue position markers (lines like align:start position:0%)
Remove blank lines
Deduplicate consecutive identical lines
Save the cleaned text as {vault_root}/transcripts/{youtube_id}.txt

Fallback: youtube-transcript-api

Use if yt-dlp fails (e.g., no auto-captions available):

"{python_path}" -c "
from youtube_transcript_api import YouTubeTranscriptApi
transcript = YouTubeTranscriptApi.get_transcript('{youtube_id}', languages=['en'])
for entry in transcript:
    print(entry['text'])
" > "{vault_root}/transcripts/{youtube_id}.txt"

Where {python_path} is config.python_path from the tracking database.

This produces clean text directly — no VTT cleanup needed.

B. Download Slides PDF

"{python_path}" -m gdown "https://drive.google.com/uc?id={google_drive_id}" \
  -O "{vault_root}/slides/{google_drive_id}.pdf"

Where {python_path} is config.python_path from the tracking database.

Then read the PDF to understand slide content and visual structure.

C. Download Video (for slide extraction)

When no PDF or PPTX is available, download the video to extract slides from frames.

mkdir -p "{vault_root}/slides-rebuild/{youtube_id}"
yt-dlp -f "bestvideo[height<=720][ext=mp4]+bestaudio[ext=m4a]/best[height<=720][ext=mp4]/best[height<=720]" \
  --merge-output-format mp4 \
  -o "{vault_root}/slides-rebuild/{youtube_id}/{youtube_id}.mp4" \
  "https://www.youtube.com/watch?v={youtube_id}"

After download, run the extraction script from references/video-slide-extraction.md. The script extracts frames, detects the slide region, deduplicates, and produces a PDF at slides/{youtube_id}.pdf. Delete the video after extraction to save space.

D. Batch Video Download

For processing many playlist talks at once, download videos in parallel:

# Download up to 3 videos concurrently
for yt_id in ID1 ID2 ID3 ...; do
  (
    yt-dlp -f "bestvideo[height<=720][ext=mp4]+bestaudio[ext=m4a]/best[height<=720]" \
      --merge-output-format mp4 \
      -o "{vault_root}/slides-rebuild/${yt_id}/${yt_id}.mp4" \
      "https://www.youtube.com/watch?v=${yt_id}" 2>/dev/null
    echo "Downloaded: ${yt_id}"
  ) &
  # Limit concurrency
  [ $(jobs -r -p | wc -l) -ge 3 ] && wait -n
done
wait

Then run the extraction script on each downloaded video sequentially.

Install with Tessl CLI

npx tessl i jbaruch/speaker-toolkit@0.6.2

skills

presentation-creator

rhetoric-knowledge-vault

references

download-commands.md

pptx-extraction.md

rhetoric-dimensions.md

schemas.md

speaker-profile-schema.md

video-slide-extraction.md

SKILL.md

README.md

tile.json