Generates professional slide deck images from content. Creates outlines with style instructions, then generates individual slide images. Use when user asks to "create slides", "make a presentation", "generate deck", "slide deck", or "PPT".
88
88%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Transform content into professional slide deck images. The deck is designed for reading and sharing (self-explanatory slides, logical scroll flow, social-media-friendly) rather than live presentation — that assumption drives every layout and density decision below.
When this skill prompts the user, follow this tool-selection rule (priority order):
AskUserQuestion, request_user_input, clarify, ask_user, or any equivalent.Concrete AskUserQuestion references below are examples — substitute the local equivalent in other runtimes.
When this skill needs to render an image, resolve the backend in this order:
EXTEND.md sets preferred_image_backend to a backend available right now, use it.auto, unset, or the pinned backend isn't available):
imagegen, Hermes image_generate), use it. Runtime-native tools are preferred by default — agents that know their own tool inventory should surface the native one here.baoyu-imagine), use it.Setting preferred_image_backend: ask forces the step-3 prompt every run regardless of available backends. Users change the pinned backend via the ## Changing Preferences section below.
Prompt file requirement (hard): write each image's full, final prompt to a standalone file under prompts/ (naming: NN-slide-[slug].md) BEFORE invoking any backend. The file is the reproducibility record and lets you switch backends without regenerating prompts.
Concrete tool names (imagegen, image_generate, baoyu-imagine) above are examples — substitute the local equivalents under the same rule.
Default behavior: confirm before generation.
EXTEND.md defaults as recommendation inputs only. None of them authorizes skipping confirmation.Respond in the user's language across questions, progress reports, error messages, and the completion summary. Keep technical tokens (style names, file paths, code) in English.
{baseDir} = this SKILL.md's directory. Resolve ${BUN_X}: prefer bun; else npx -y bun; else suggest brew install oven-sh/bun/bun.
| Script | Purpose |
|---|---|
scripts/merge-to-pptx.ts | Merge slides into PowerPoint |
scripts/merge-to-pdf.ts | Merge slides into PDF |
| Option | Description |
|---|---|
--style <name> | Preset (see Presets below), custom, or custom style name |
--audience <type> | beginners / intermediate / experts / executives / general |
--lang <code> | Output language (en, zh, ja, ...) |
--slides <N> | Target slide count (8-25 recommended, max 30) |
--ref <files...> | Reference images applied per slide (style / palette / composition / subject) |
--outline-only | Stop after outline |
--prompts-only | Stop after prompts (skip image generation) |
--images-only | Skip to Step 7; requires existing prompts/ |
--regenerate <N> | Regenerate specific slide(s): 3 or 2,5,8 |
17 presets covering technical / educational / lifestyle / editorial use cases. Every preset is a combination of four dimensions (texture / mood / typography / density). If the user picks "Custom dimensions" in Round 1, Round 2 of the confirmation asks one question per dimension — options and verbatim copy live in references/confirmation.md.
| Preset | Dimensions | Best For |
|---|---|---|
blueprint (Default) | grid + cool + technical + balanced | Architecture, system design |
chalkboard | organic + warm + handwritten + balanced | Education, tutorials |
corporate | clean + professional + geometric + balanced | Investor decks, proposals |
minimal | clean + neutral + geometric + minimal | Executive briefings |
sketch-notes | organic + warm + handwritten + balanced | Educational, tutorials |
hand-drawn-edu | organic + macaron + handwritten + balanced | Educational diagrams, process explainers |
watercolor | organic + warm + humanist + minimal | Lifestyle, wellness |
dark-atmospheric | clean + dark + editorial + balanced | Entertainment, gaming |
notion | clean + neutral + geometric + dense | Product demos, SaaS |
bold-editorial | clean + vibrant + editorial + balanced | Product launches, keynotes |
editorial-infographic | clean + cool + editorial + dense | Tech explainers, research |
fantasy-animation | organic + vibrant + handwritten + minimal | Educational storytelling |
intuition-machine | clean + cool + technical + dense | Technical docs, academic |
pixel-art | pixel + vibrant + technical + balanced | Gaming, developer talks |
scientific | clean + cool + technical + dense | Biology, chemistry, medical |
vector-illustration | clean + vibrant + humanist + balanced | Creative, children's content |
vintage | paper + warm + editorial + balanced | Historical, heritage |
Per-preset specs: references/styles/<preset>.md. Preset → dimension mapping: references/dimensions/presets.md.
| Dimension | Options | Purpose |
|---|---|---|
| Texture | clean, grid, organic, pixel, paper | Background treatment |
| Mood | professional, warm, cool, vibrant, dark, neutral, macaron | Color temperature |
| Typography | geometric, humanist, handwritten, editorial, technical | Headline/body styling |
| Density | minimal, balanced, dense | Information per slide |
Full per-dimension specs: references/dimensions/*.md.
Match content signals to a preset. Pick the first row whose signal keywords appear in the source; fall back to blueprint if nothing matches.
| Signals in source | Preset |
|---|---|
| tutorial, learn, education, guide, beginner | sketch-notes |
| hand-drawn, infographic, diagram, process, onboarding | hand-drawn-edu |
| classroom, teaching, school, chalkboard | chalkboard |
| architecture, system, data, analysis, technical | blueprint |
| creative, children, kids, cute | vector-illustration |
| briefing, academic, research, bilingual | intuition-machine |
| executive, minimal, clean, simple | minimal |
| saas, product, dashboard, metrics | notion |
| investor, quarterly, business, corporate | corporate |
| launch, marketing, keynote, magazine | bold-editorial |
| entertainment, music, gaming, atmospheric | dark-atmospheric |
| explainer, journalism, science communication | editorial-infographic |
| story, fantasy, animation, magical | fantasy-animation |
| gaming, retro, pixel, developer | pixel-art |
| biology, chemistry, medical, scientific | scientific |
| history, heritage, vintage, expedition | vintage |
| lifestyle, wellness, travel, artistic | watercolor |
| Source length | Recommended slides |
|---|---|
| < 1000 words | 5-10 |
| 1000-3000 words | 10-18 |
| 3000-5000 words | 15-25 |
| > 5000 words | 20-30 (consider splitting) |
Users may supply reference images to guide style, palette, layout, or subject.
Intake: Accept via --ref <files...> or when the user provides file paths / pastes images in conversation.
{slide-deck-dir}/refs/NN-ref-{slug}.{ext}Usage modes (per reference):
| Usage | Effect |
|---|---|
direct | Pass the file to the backend as a reference image for each slide |
style | Extract style traits (line treatment, texture, mood) and append to every slide's prompt body |
palette | Extract hex colors and append to every slide's prompt body |
Record refs in each slide's prompt frontmatter:
references:
- ref_id: 01
filename: 01-ref-brand.png
usage: directAt generation time, verify files exist. If usage: direct and the backend accepts refs (e.g., baoyu-imagine --ref), pass the file on every slide. Otherwise embed extracted style/palette traits in the prompt text.
slide-deck/{topic-slug}/
├── source-{slug}.{ext}
├── outline.md
├── prompts/NN-slide-{slug}.md
├── NN-slide-{slug}.png
├── {topic-slug}.pptx
└── {topic-slug}.pdfSlug: 2-4 words, kebab-case, extracted from topic. "Introduction to Machine Learning" → intro-machine-learning.
Backup rule (applies across steps): if a file about to be written already exists, rename it to <name>-backup-YYYYMMDD-HHMMSS.<ext> before writing the new one. This protects user edits and enables rollback.
Copy this checklist and check off items as you complete them:
- [ ] Step 1: Setup & analyze
- [ ] Step 2: Confirmation ⚠️ REQUIRED (Round 1; Round 2 only if "Custom dimensions")
- [ ] Step 3: Generate outline
- [ ] Step 4: Review outline (conditional)
- [ ] Step 5: Generate prompts
- [ ] Step 6: Review prompts (conditional)
- [ ] Step 7: Generate images
- [ ] Step 8: Merge to PPTX/PDF
- [ ] Step 9: Output summary1.1 Load EXTEND.md — check these paths in order; first hit wins:
| Path | Scope |
|---|---|
.baoyu-skills/baoyu-slide-deck/EXTEND.md | Project |
${XDG_CONFIG_HOME:-$HOME/.config}/baoyu-skills/baoyu-slide-deck/EXTEND.md | XDG |
$HOME/.baoyu-skills/baoyu-slide-deck/EXTEND.md | User home |
If found, read, parse, and print a summary (style / audience / language / review). If not, proceed with defaults — first-time setup is not blocking for this skill. Schema: references/config/preferences-schema.md.
1.2 Analyze content — follow references/analysis-framework.md: classify content, detect language, note signals for style selection, estimate slide count from length (see the Slide Count Heuristic in Style System above), generate topic slug. Save source as source.md (honor backup rule if one exists).
1.3 Check existing output ⚠️ REQUIRED before Step 2. If slide-deck/{topic-slug}/ exists, ask how to proceed — four options (regenerate outline / regenerate images / backup and regenerate / exit), verbatim copy in references/confirmation.md.
Save findings to analysis.md: topic, audience, signals, recommended style and slide count, language detection.
Hard gate: this step is mandatory per the Confirmation Policy — Steps 3+ cannot start until the user confirms here (or explicitly opts out with "直接生成" / equivalent wording in the current request).
Round 1 (always) — batch five questions in one AskUserQuestion call: style, audience, slide count, review-outline?, review-prompts?. Verbatim options in references/confirmation.md.
Summary displayed before the questions:
Round 2 (only if "Custom dimensions" in Round 1) — batch four questions: texture, mood, typography, density. Verbatim options in references/confirmation.md. The four answers replace the preset.
After confirmation: update analysis.md with final choices and store skip_outline_review / skip_prompt_review flags from Q4/Q5.
Resolve style: preset → references/styles/{preset}.md; custom dimensions → combine files in references/dimensions/. Build STYLE_INSTRUCTIONS from the resolved style, apply confirmed audience + language + slide count, follow references/outline-template.md, and save as outline.md.
Stop here if --outline-only. Skip Step 4 if skip_outline_review.
Display a slide-by-slide table (# | Title | Type | Layout) along with total count and resolved style. Ask: proceed / edit outline first / regenerate — verbatim in references/confirmation.md.
On "Edit outline first", tell the user to edit outline.md and ask again when ready. On "Regenerate outline", return to Step 3.
For each slide in outline:
references/base-prompt.mdSTYLE_INSTRUCTIONS from the outline (don't re-read the style file)Layout: is specified, include guidance from references/layouts.mdprompts/NN-slide-{slug}.md (backup rule applies)Stop here if --prompts-only. Skip Step 6 if skip_prompt_review.
Display the prompts index (# | Filename | Slide Title) and ask: proceed / edit prompts first / regenerate — verbatim in references/confirmation.md. Branches mirror Step 4.
prompts/NN-slide-{slug}.md exists (hard requirement; prompt files are the reproducibility record regardless of backend).slides-{topic-slug}-{timestamp} — pass to the backend only if it supports sessions.Generated X/N. Auto-retry once on failure before reporting an error.--regenerate N jumps to this step for the named slides only. --images-only starts here with existing prompts.
${BUN_X} {baseDir}/scripts/merge-to-pptx.ts <slide-deck-dir>
${BUN_X} {baseDir}/scripts/merge-to-pdf.ts <slide-deck-dir>Slide Deck Complete!
Topic: [topic]
Style: [preset or "custom: texture+mood+typography+density"]
Location: [directory]
Slides: N
- 01-slide-cover.png
- ...
- NN-slide-back-cover.png
Outline: outline.md
PPTX: {topic-slug}.pptx
PDF: {topic-slug}.pdf| Action | How |
|---|---|
| Edit | Update prompts/NN-slide-{slug}.md first, then --regenerate N |
| Add | Create new prompt at position, generate image, renumber subsequent NN (slugs unchanged), update outline.md, re-merge |
| Delete | Remove PNG + prompt, renumber subsequent, update outline.md, re-merge |
Always update the prompt file before regenerating the image — this keeps the prompts directory as the source of truth and makes changes reproducible. Only NN changes on renumber; slugs stay stable so references remain valid.
See references/modification-guide.md for full details.
| File | Content |
|---|---|
references/confirmation.md | Verbatim AskUserQuestion option copy for every confirmation |
references/analysis-framework.md | Content analysis framework |
references/outline-template.md | Outline structure |
references/base-prompt.md | Base prompt body for image generation |
references/layouts.md | Layout options |
references/design-guidelines.md | Audience, typography, color selection |
references/content-rules.md | Content guidelines |
references/modification-guide.md | Edit/add/delete workflows |
references/styles/<preset>.md | Per-preset specifications |
references/dimensions/*.md | Per-dimension specifications |
references/config/preferences-schema.md | EXTEND.md schema |
EXTEND.md lives at the first matching path listed in Step 1.1. Two ways to change it:
references/config/preferences-schema.md.preferred_image_backend: auto — default; runtime-native tool wins, falls back to the only installed backend, asks only if multiple non-native are present.preferred_image_backend: codex-imagegen — pin to Codex's built-in.preferred_image_backend: baoyu-imagine — pin to the baoyu-imagine skill.preferred_image_backend: ask — confirm backend every run.preferred_style: blueprint, preferred_audience: experts, language: zh.505a7e1
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.