Name: catalan-adobe/demo-narrate
Rating: 72 (1 reviews)
Author: catalan-adobe

catalan-adobe/demo-narrate

Use when the user wants to narrate a demo, add voice-over to a screen recording, or create AI narration for a silent video. End-to-end pipeline that extracts frames, analyzes with parallel subagents, writes a word-budgeted voice-over script, generates TTS audio per act, and merges everything back.

Quality

90%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a strong, highly actionable skill with an excellent multi-step workflow that includes proper validation checkpoints, error recovery, and user approval gates. The pipeline is well-sequenced and provides concrete commands, formulas, and decision criteria at every step. The main weakness is moderate verbosity — some explanations could be trimmed and more detail could be pushed to the referenced REFERENCE.md to keep the main file leaner.

Suggestions

Trim the script-locating fallback block — the primary CLAUDE_SKILL_DIR path plus a one-line fallback note would suffice, saving ~10 lines.

Move the detailed rate-fitting algorithm explanation (measurement pass, +3% escalation increments) to REFERENCE.md and keep only the status labels and 'trim if LONG' action in the main file.

Dimension	Reasoning	Score
Conciseness	The skill is fairly long (~300 lines) but most content earns its place with specific commands, formulas, and workflow details. Some sections could be tightened — e.g., the script-locating fallback block is verbose, and explanations like 'changing text is free, re-generating audio costs time' are unnecessary for Claude. The word-budget math is well-presented but slightly over-explained.	2 / 3
Actionability	Highly actionable throughout: every step has concrete, executable bash commands, specific formulas for word budgets (max_audio = act_window - 1s, word_budget = max_audio x 2), exact file formats for timing.txt, and clear subagent prompt templates. The TTS rate-fitting logic and status labels (OK/RATE/LONG) provide precise decision criteria.	3 / 3
Workflow Clarity	The 8-step pipeline is clearly sequenced with explicit validation checkpoints: dependency checking (Step 0), dry-run budget verification (Step 4c), user approval before audio generation (Step 4f), LONG status detection with retry instructions (Step 5), ffprobe verification of output (Step 7), and error recovery for failed subagents (Step 3). Feedback loops are well-defined throughout.	3 / 3
Progressive Disclosure	References to REFERENCE.md for voice options and command details are well-signaled, but the main SKILL.md is quite long with substantial inline detail that could potentially be split out (e.g., the rate-fitting algorithm details, the script-locating fallback). However, without bundle files provided to verify the reference structure, and given that most inline content is directly needed for execution, this is adequate but not optimal.	2 / 3
	Total	10 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly communicates a specific, well-defined pipeline for adding AI narration to videos. It opens with an explicit 'Use when' clause containing natural trigger terms, then describes the concrete end-to-end workflow. The description is concise yet comprehensive, making it easy to distinguish from other skills.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: extracts frames, analyzes with parallel subagents, writes a word-budgeted voice-over script, generates TTS audio per act, and merges everything back. These are detailed, concrete pipeline steps.	3 / 3
Completeness	Clearly answers both 'what' (end-to-end pipeline that extracts frames, analyzes, writes scripts, generates TTS, merges) and 'when' (explicit 'Use when' clause covering demo narration, voice-over for screen recordings, AI narration for silent videos).	3 / 3
Trigger Term Quality	Includes strong natural trigger terms users would say: 'narrate a demo', 'voice-over', 'screen recording', 'AI narration', 'silent video', 'TTS audio'. These cover multiple natural phrasings a user might use.	3 / 3
Distinctiveness Conflict Risk	Highly distinctive niche combining video processing with voice-over/narration generation. The specific pipeline (frame extraction → analysis → script writing → TTS → merging) is unlikely to conflict with other skills like generic video editing or text-to-speech alone.	3 / 3
	Total	12 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Reviewed

about 1 month ago

Table of Contents

Discovery Implementation Validation