CtrlK
BlogDocsLog inGet started
Tessl Logo

jbaruch/speaker-toolkit

Five-skill presentation system: ingest talks into a rhetoric vault, run interactive clarification, generate a speaker profile, create presentations that match your documented patterns, and produce the deck illustrations + thumbnail visual layer. Includes a 102-entry Presentation Patterns taxonomy (91 observable, 11 unobservable go-live items) for scoring, brainstorming, and go-live preparation.

93

1.43x
Quality

94%

Does it follow best practices?

Impact

93%

1.43x

Average score across 21 eval scenarios

SecuritybySnyk

Advisory

Suggest reviewing before use

Overview
Quality
Evals
Security
Files

task.mdevals/scenario-21/

Video Slide Extraction Quality Diagnostics

Problem/Feature Description

Six conference talk recordings have been processed through the video-slide-extraction pipeline. The extraction results vary widely.

Analyze the extraction results and produce a structured diagnostics report for each case.

Setup

Download the fixed extraction results:

curl -sLO https://github.com/jbaruch/speaker-toolkit/raw/main/eval-resources/video-extraction-diagnostics/extraction_results.json

Task

Analyze extraction_results.json (6 recordings, identified r1 through r6 in the fixture) and produce diagnostics_report.json containing a per-recording diagnostic entry.

For each recording, examine the available extraction metadata (frame counts, unique-frame counts, hash threshold, slide-region detection, transcript source, transcript language, detected vs expected speaker, word counts) and produce:

  • a recording-type classification
  • a dedup quality assessment, with remediations if dedup appears to have failed
  • a transcript quality assessment, including handling of missing transcripts, language-vs-expectation mismatches, and any quality issues you detect in the transcript content
  • a speaker identity check, since the playlist's expected speaker is known
  • a clean pass-through verdict for recordings that show no anomalies

The expected speaker for this playlist is Baruch Sadogursky. The fixture's recordings include a mix of healthy and unhealthy cases; do not assume which is which from the IDs — derive each verdict from the metadata.

Output Specification

Produce diagnostics_report.json containing a structured per-recording diagnostic record (one entry per recording).

Also produce recommendations_log.txt — a human-readable summary of all flagged issues and recommendations, suitable for a production operator to review.

evals

README.md

tile.json