CtrlK
BlogDocsLog inGet started
Tessl Logo

run-smoke-tests

Run Playwright smoke tests, debug failures, and verify fixes

62

Quality

52%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./cursor-team-kit/skills/run-smoke-tests/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

32%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description identifies a clear domain (Playwright smoke tests) and lists three high-level actions, but it lacks a 'Use when...' clause which is critical for skill selection. It would benefit from more specific trigger terms covering common user phrasings and explicit guidance on when Claude should activate this skill.

Suggestions

Add a 'Use when...' clause such as 'Use when the user asks to run Playwright tests, debug e2e test failures, or verify browser test fixes.'

Include additional natural trigger terms like 'e2e tests', 'end-to-end tests', 'browser tests', 'test automation', or file extensions like '.spec.ts'.

Expand the specific actions to be more concrete, e.g., 'Run Playwright smoke tests, analyze test failure logs and screenshots, re-run failing tests, and verify fixes pass.'

DimensionReasoningScore

Specificity

Names the domain (Playwright smoke tests) and lists some actions (run, debug failures, verify fixes), but doesn't elaborate on specific concrete actions like reading test reports, capturing screenshots, or analyzing error logs.

2 / 3

Completeness

Describes what the skill does (run Playwright smoke tests, debug failures, verify fixes) but completely lacks a 'Use when...' clause or any explicit trigger guidance for when Claude should select this skill. Per rubric guidelines, missing 'Use when' caps completeness at 2, and since the 'what' is also only moderately detailed, this scores a 1.

1 / 3

Trigger Term Quality

Includes 'Playwright', 'smoke tests', 'debug failures', and 'verify fixes' which are relevant keywords, but misses common variations like 'e2e tests', 'end-to-end', 'browser tests', 'test automation', or 'npx playwright'.

2 / 3

Distinctiveness Conflict Risk

Mentioning 'Playwright' specifically helps distinguish it from generic testing skills, but 'debug failures and verify fixes' is broad enough to overlap with other debugging or testing skills.

2 / 3

Total

7

/

12

Passed

Implementation

72%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

A concise and well-structured skill that provides useful npm commands and sensible guardrails. However, the workflow steps are too abstract—particularly around debugging failures and inspecting traces—which limits actionability. Adding specific Playwright debugging commands and trace inspection techniques would significantly improve the skill.

Suggestions

Add concrete commands for debugging failures, e.g., `npx playwright show-trace trace.zip` or how to read Playwright's HTML report

Provide a specific example of isolating a root cause (e.g., a sample test failure output and the diagnostic steps taken)

Include explicit validation checkpoints in the workflow, such as 'confirm the build succeeded before running tests' with a specific check command

DimensionReasoningScore

Conciseness

The content is lean and efficient. No unnecessary explanations of what smoke tests are or how Playwright works. Every section earns its place and respects Claude's intelligence.

3 / 3

Actionability

Provides concrete npm commands which are helpful, but the workflow steps (1-4) are fairly generic and abstract—'Build prerequisites for the target app' and 'inspect traces/logs and isolate the root cause' lack specific commands or techniques. No executable code for debugging or trace inspection.

2 / 3

Workflow Clarity

The workflow has a clear sequence (build → run → debug → fix → rerun) with an implicit feedback loop in step 4, but lacks explicit validation checkpoints and concrete guidance on how to inspect traces/logs or determine root cause. The 'rerun until stable' step is vague.

2 / 3

Progressive Disclosure

For a simple, short skill with no bundle files, the content is well-organized into clear sections (Trigger, Workflow, Example Commands, Guardrails, Output) that are easy to scan and navigate.

3 / 3

Total

10

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
cursor/plugins
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.