CtrlK
BlogDocsLog inGet started
Tessl Logo

catalan-adobe/demo-narrate

Use when the user wants to narrate a demo, add voice-over to a screen recording, or create AI narration for a silent video. End-to-end pipeline that extracts frames, analyzes with parallel subagents, writes a word-budgeted voice-over script, generates TTS audio per act, and merges everything back.

74

Quality

93%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly communicates a specific, well-defined pipeline for adding AI narration to videos. It opens with an explicit 'Use when' clause containing natural trigger terms, then describes the concrete end-to-end workflow. The description is concise yet comprehensive, making it easy to distinguish from other skills.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: extracts frames, analyzes with parallel subagents, writes a word-budgeted voice-over script, generates TTS audio per act, and merges everything back. These are detailed, concrete pipeline steps.

3 / 3

Completeness

Clearly answers both 'what' (end-to-end pipeline that extracts frames, analyzes, writes scripts, generates TTS, merges) and 'when' (explicit 'Use when' clause covering demo narration, voice-over for screen recordings, AI narration for silent videos).

3 / 3

Trigger Term Quality

Includes strong natural trigger terms users would say: 'narrate a demo', 'voice-over', 'screen recording', 'AI narration', 'silent video', 'TTS audio'. These cover multiple natural phrasings a user might use.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive niche combining video processing with voice-over/narration generation. The specific pipeline (frame extraction → analysis → script writing → TTS → merging) is unlikely to conflict with other skills like generic video editing or text-to-speech alone.

3 / 3

Total

12

/

12

Passed

Implementation

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a strong, highly actionable skill with an excellent multi-step workflow that includes proper validation checkpoints, error recovery, and user approval gates. The pipeline is well-sequenced and provides concrete commands, formulas, and decision criteria at every step. The main weakness is moderate verbosity — some explanations could be trimmed and more detail could be pushed to the referenced REFERENCE.md to keep the main file leaner.

Suggestions

Trim the script-locating fallback block — the primary CLAUDE_SKILL_DIR path plus a one-line fallback note would suffice, saving ~10 lines.

Move the detailed rate-fitting algorithm explanation (measurement pass, +3% escalation increments) to REFERENCE.md and keep only the status labels and 'trim if LONG' action in the main file.

DimensionReasoningScore

Conciseness

The skill is fairly long (~300 lines) but most content earns its place with specific commands, formulas, and workflow details. Some sections could be tightened — e.g., the script-locating fallback block is verbose, and explanations like 'changing text is free, re-generating audio costs time' are unnecessary for Claude. The word-budget math is well-presented but slightly over-explained.

2 / 3

Actionability

Highly actionable throughout: every step has concrete, executable bash commands, specific formulas for word budgets (max_audio = act_window - 1s, word_budget = max_audio x 2), exact file formats for timing.txt, and clear subagent prompt templates. The TTS rate-fitting logic and status labels (OK/RATE/LONG) provide precise decision criteria.

3 / 3

Workflow Clarity

The 8-step pipeline is clearly sequenced with explicit validation checkpoints: dependency checking (Step 0), dry-run budget verification (Step 4c), user approval before audio generation (Step 4f), LONG status detection with retry instructions (Step 5), ffprobe verification of output (Step 7), and error recovery for failed subagents (Step 3). Feedback loops are well-defined throughout.

3 / 3

Progressive Disclosure

References to REFERENCE.md for voice options and command details are well-signaled, but the main SKILL.md is quite long with substantial inline detail that could potentially be split out (e.g., the rate-fitting algorithm details, the script-locating fallback). However, without bundle files provided to verify the reference structure, and given that most inline content is directly needed for execution, this is adequate but not optimal.

2 / 3

Total

10

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Reviewed

Table of Contents