run-smoke-tests

Run Playwright smoke tests, debug failures, and verify fixes

Quality

80%

Does it follow best practices?

Run evals on this skill

Adds up to 20 points to the overall score

View guide

Securityby

Passed

No findings from the security scan

Fix and improve this skill with Tessl

tessl review fix ./cursor-team-kit/skills/run-smoke-tests/SKILL.md

Quality

Content

100%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is concise, actionable, and well-structured with executable commands and an explicit run-inspect-fix-rerun feedback loop, scoring at the top of the scale across all content dimensions.

Dimension	Reasoning	Score
Conciseness	The body is lean with no concept explanations or padding, every section earns its place, and it assumes Claude's competence, matching the score-3 'lean and efficient' anchor rather than the slightly padded score-2 example.	3 / 3
Actionability	It provides fully executable, copy-paste-ready commands such as 'npm run smoketest -- path/to/test.spec.ts' and 'npm run smoketest-no-compile -- path/to/test.spec.ts', matching the score-3 anchor for concrete, executable guidance.	3 / 3
Workflow Clarity	The four-step workflow has a clear sequence with an explicit feedback loop (step 3 inspect traces/logs on failure, step 4 'Apply a minimal fix and rerun until stable'), matching the score-3 anchor for clear sequence with error-recovery feedback loops.	3 / 3
Progressive Disclosure	The skill is under 50 lines, needs no external references, and is organized into clearly signaled sections (Trigger, Workflow, Example Commands, Guardrails, Output), which the rubric notes can score 3 for simple skills with well-organized sections.	3 / 3
	Total	12 / 12 Passed

Description

60%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is specific and action-oriented with a clear niche, but it omits an explicit 'Use when...' trigger clause and broader natural-language keyword coverage, which together cap several dimensions at 2.

Suggestions

Append an explicit trigger clause such as 'Use when the user asks to run, debug, or verify Playwright smoke/e2e tests before or after changes.'

Broaden trigger-term coverage with natural variations users say, e.g. 'smoke tests, e2e/end-to-end tests, flaky tests, test failures'.

Clarify distinctiveness from generic test-running skills by naming Playwright specifically in the trigger (e.g. 'Use when running Playwright e2e/smoke suites').

Dimension	Reasoning	Score
Specificity	The description 'Run Playwright smoke tests, debug failures, and verify fixes' lists three distinct concrete actions, matching the score-3 anchor of multiple specific concrete actions rather than the partial coverage of score 2.	3 / 3
Completeness	It clearly states what the skill does but lacks any 'Use when...' clause or explicit trigger guidance, and the rubric caps completeness at 2 when that trigger guidance is missing.	2 / 3
Trigger Term Quality	It includes natural terms like 'Playwright' and 'smoke tests' but omits common variations users would say ('e2e/end-to-end tests', 'flaky tests', 'test suite'), fitting score 2's 'some relevant keywords but missing common variations' rather than the broad coverage of score 3.	2 / 3
Distinctiveness Conflict Risk	'Playwright smoke tests' is a recognizable niche, but it could overlap with sibling test-running skills and lacks explicit disambiguating triggers, matching score 2 rather than the clear, distinct-trigger profile of score 3.	2 / 3
	Total	9 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 16 / 16 Passed

Validation for skill structure

No warnings or errors.

Repository: cursor/plugins
Path: cursor-team-kit/skills/run-smoke-tests/SKILL.md
Commit: fe77e77

Reviewed: 2 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.