Two-skill presentation system: analyze your speaking style into a rhetoric knowledge vault, then create new presentations that match your documented patterns. Includes an 88-entry Presentation Patterns taxonomy for scoring, brainstorming, and go-live preparation.
96
Quality
96%
Does it follow best practices?
Impact
96%
1.57xAverage score across 15 eval scenarios
{
"context": "Tests whether the agent produces a complete rhetorical review: a guardrail check covering all 9 required items with correct PASS/FAIL/WARN labeling (including WARN at the 5%-boundary), plus a 4-tier Pattern Strategy with antipatterns labeled [RECURRING] or [CONTEXTUAL] and a pattern score projection.",
"type": "weighted_checklist",
"checklist": [
{
"name": "All 9 guardrail items present",
"description": "The review output contains a guardrail check section with at least 9 distinct labeled check items covering: slide budget, Act 1 ratio, branding/conference elements, profanity, data attribution, time-sensitive content, closing completeness, cut lines, and anti-patterns — each as a separate line or block, not lumped into prose",
"max_score": 15
},
{
"name": "PASS/FAIL/WARN labels used",
"description": "Guardrail check items use the literal labels [PASS], [FAIL], or [WARN] (or PASS/FAIL/WARN in equivalent format) — not just descriptive prose without labels",
"max_score": 8
},
{
"name": "WARN for near-limit Act 1",
"description": "The Act 1 ratio check produces [WARN] (not [PASS] and not [FAIL]) because the ratio is within 5% of the configured limit — specifically Act 1 is slides 8-33 (26 slides out of 60 total = 43.3%), which is under the 45% limit but within the 5-percentage-point warn threshold",
"max_score": 10
},
{
"name": "4-tier Pattern Strategy present",
"description": "The review output includes a Pattern Strategy section organized into exactly four labeled tiers: Signature (YOUR TOOLKIT or equivalent), Contextual (WORTH CONSIDERING or equivalent), New to You (NEW TO YOU or equivalent), and Shake It Up (SHAKE IT UP / WILD CARD or equivalent)",
"max_score": 15
},
{
"name": "Shake It Up has 1-2 items",
"description": "The Shake It Up / Wild Card tier contains exactly 1 or 2 pattern suggestions — not 0 and not 3 or more",
"max_score": 8
},
{
"name": "[RECURRING] tag used",
"description": "At least one antipattern detection is labeled [RECURRING] (matching a pattern in the speaker profile's antipattern_frequency history), not just listed as a generic warning",
"max_score": 10
},
{
"name": "[CONTEXTUAL] tag used",
"description": "At least one antipattern detection is labeled [CONTEXTUAL] (detected in the current outline but not in the speaker's historical profile), not just listed as a generic warning",
"max_score": 10
},
{
"name": "Pattern score projection",
"description": "The review includes a pattern score projection or estimated score (e.g., 'Pattern score projection: ~8') — not absent or replaced only by a list",
"max_score": 8
},
{
"name": "Profile slide budgets used",
"description": "The slide budget check uses the value from the speaker profile's guardrail_sources.slide_budgets (68 max for a 45-minute talk) — not the generic default of 1.5 × duration or a hardcoded number different from the profile",
"max_score": 8
},
{
"name": "Recurring issue flagged",
"description": "The anti-patterns section references at least one of the speaker's known recurring_issues from the profile (specifically 'Act 1 overrun' or 'meme accretion') — not just generic antipatterns",
"max_score": 8
}
]
}Install with Tessl CLI
npx tessl i jbaruch/speaker-toolkit@0.6.2evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
scenario-11
scenario-12
scenario-13
scenario-14
scenario-15
skills
presentation-creator
references
patterns
build
deliver
prepare
rhetoric-knowledge-vault