Architect, build, and debug Kafka Streams apps (JVM-embedded stream processing). Use when user mentions KStream, KTable, topology, TopologyTestDriver, StreamsBuilder, interactive queries, GlobalKTable, joins/windows/aggregations, or debugging issues (rebalancing, state stores, lag, deserialization errors). Also use when user wants to optimize Kafka Streams for WarpStream or tune Kafka Streams client configuration for WarpStream. Do NOT trigger for Flink, connectors, CDC, or plain producer/consumer.
75
92%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that hits all the marks. It provides specific concrete actions, comprehensive trigger terms covering both class names and debugging scenarios, explicit 'Use when' and 'Do NOT trigger' clauses, and clear boundaries that distinguish it from related Kafka or stream processing skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'Architect, build, and debug Kafka Streams apps', mentions specific concepts like JVM-embedded stream processing, joins/windows/aggregations, interactive queries, state stores, and optimization for WarpStream. | 3 / 3 |
Completeness | Clearly answers both 'what' (architect, build, debug Kafka Streams apps) and 'when' with an explicit 'Use when...' clause listing specific trigger scenarios. Also includes a 'Do NOT trigger' clause for exclusions, which adds further clarity. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms users would say: KStream, KTable, topology, TopologyTestDriver, StreamsBuilder, interactive queries, GlobalKTable, joins, windows, aggregations, rebalancing, state stores, lag, deserialization errors, WarpStream. Also includes negative triggers (Flink, connectors, CDC) to reduce false matches. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive with a clear niche (Kafka Streams specifically, not general Kafka). The explicit exclusion of Flink, connectors, CDC, and plain producer/consumer significantly reduces conflict risk with adjacent skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
85%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a high-quality, well-architected skill that excels at actionability, workflow clarity, and progressive disclosure. The three-mode structure (Architect/Build/Debug) with clear routing is effective, and the invariant checklist provides strong guardrails. The main weakness is moderate verbosity — particularly around WarpStream-specific guidance that's repeated across sections and the lengthy Build Mode Step 6 environment branching — though this verbosity does serve clarity for a genuinely complex domain.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is well-structured and avoids explaining basic Kafka concepts, but it's quite lengthy (~250 lines) with some repetition — e.g., WarpStream caveats are restated across the preamble, Build Mode Step 6, and Debug Mode. The lazy-load preamble and environment confirmation sections add useful but somewhat verbose framing. Some sections (like the full Build Mode Step 6 branching for CC/WarpStream/Local) could be tightened. | 2 / 3 |
Actionability | The skill provides highly concrete, actionable guidance: specific commands (docker compose up, gradle wrapper, gradlew run), exact log strings to look for ('State transition from REBALANCING to RUNNING'), specific config properties, exact serde class names, and clear decision trees. The invariant checklist is particularly actionable with specific property names and patterns. Code generation steps are explicit about what files to produce. | 3 / 3 |
Workflow Clarity | Each mode (Architect, Build, Debug) has clearly numbered steps with explicit sequencing. Build Mode Step 6 is exemplary — it includes validation checkpoints (confirm RUNNING state within ~30s), error recovery loops (read stack trace → diagnose via debugging.md → fix → restart → re-verify), and honest handling of cases where verification isn't possible (CC/WarpStream without creds). The Debug Mode has a clear classify → gather → diagnose flow with a symptom-to-category routing table. | 3 / 3 |
Progressive Disclosure | The skill is an excellent overview that consistently points to specific sections of reference files on-demand (e.g., 'references/topology-patterns.md § Joins Decision Tree'). The lazy-load preamble with concrete examples is a strong pattern. References are one level deep, clearly signaled with section anchors, and the full reference index at the bottom provides easy navigation. Content is appropriately split between the main skill and 11 reference files. | 3 / 3 |
Total | 11 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
8b85616
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.