CtrlK
BlogDocsLog inGet started
Tessl Logo

golikovichev/pytest-conversational

Test chat bots, voice assistants, and IVR menus with pytest using a small Conversation object and a callable bot adapter. Use when the user wants to write rule-based assertions over multi-turn dialogue without bringing in an LLM dependency, when they have a chatbot reachable as a Python callable or HTTP webhook, when they need to keep per-conversation state across turns and assert on slot filling, when they want pytest-native fixtures and a printable transcript on failure, or when they mention voice-assistant testing, IVR menu testing, conversational AI testing, LLM bot testing (used as the target under test, not as the matcher), expect matchers for bot replies, or multi-turn dialogue tests.

99

1.56x
Quality

100%

Does it follow best practices?

Impact

97%

1.56x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Quality

Content

100%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is an excellent skill file that demonstrates strong conciseness, actionability, and organization. It provides complete, executable examples for each key feature (basic test, multi-turn state, HTTP adapter, matchers) while keeping the content lean and deferring detailed API documentation to REFERENCE.md. The security considerations for HTTP webhooks are appropriately included without being verbose.

DimensionReasoningScore

Conciseness

The content is lean and efficient. It assumes Claude's competence with Python, pytest, and HTTP concepts. Every section earns its place—no unnecessary explanations of what chatbots are, how pytest works, or what HTTP webhooks do. The security note is appropriately brief but present.

3 / 3

Actionability

All code examples are fully executable and copy-paste ready: the bot adapter, the test function, the slot-filling example, the HTTP webhook adapter, and the matchers. Installation commands are concrete. The progression from simple to multi-turn to HTTP is immediately usable.

3 / 3

Workflow Clarity

The quick start provides a clear 4-step sequence (install → write adapter → write test → run). The multi-turn state section demonstrates the slot-filling workflow with a complete test. For a testing library skill, the workflow is straightforward (no destructive operations requiring validation checkpoints), and the single-task nature of each test is unambiguous.

3 / 3

Progressive Disclosure

The SKILL.md provides a concise overview with quick start and key features, then clearly signals one-level-deep references to REFERENCE.md for the full API, matcher details, adapter contract, error reference, and security notes. Content is appropriately split between the overview and the reference file.

3 / 3

Total

12

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly defines a specific niche (pytest-based conversational AI testing), lists concrete capabilities, and provides comprehensive trigger guidance. The 'Use when...' clause is thorough, covering multiple scenarios and explicitly clarifying the LLM distinction. The only minor weakness is that the description is somewhat long, but the detail is substantive rather than padded.

DimensionReasoningScore

Specificity

Lists multiple concrete actions and concepts: 'Test chat bots, voice assistants, and IVR menus with pytest', 'Conversation object and a callable bot adapter', 'rule-based assertions over multi-turn dialogue', 'per-conversation state across turns', 'assert on slot filling', 'pytest-native fixtures and a printable transcript on failure'.

3 / 3

Completeness

Clearly answers both 'what' (test chat bots/voice assistants/IVR menus with pytest using a Conversation object and bot adapter) and 'when' with an explicit and detailed 'Use when...' clause covering multiple trigger scenarios.

3 / 3

Trigger Term Quality

Excellent coverage of natural trigger terms users would say: 'chat bots', 'voice assistants', 'IVR menus', 'pytest', 'multi-turn dialogue', 'chatbot', 'HTTP webhook', 'slot filling', 'voice-assistant testing', 'IVR menu testing', 'conversational AI testing', 'LLM bot testing', 'expect matchers', 'bot replies', 'multi-turn dialogue tests'.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive niche: conversational AI testing with pytest, rule-based assertions, no LLM dependency for matching. The combination of pytest + multi-turn dialogue + bot adapter is very specific and unlikely to conflict with general testing or general chatbot skills.

3 / 3

Total

12

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Reviewed

Table of Contents