playwright

Browser automation and E2E testing with Playwright. Auto-detects dev servers, writes clean test scripts. Test pages, fill forms, take screenshots, check responsive design, validate UX, test login flows, check links, automate any browser task. Use for cross-browser testing, visual regression, API testing, component testing in TypeScript/JavaScript and Python projects.

Quality

71%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Advisory

Suggest reviewing before use

Fix and improve this skill with Tessl

tessl review fix ./plugins/playwright/skills/playwright/SKILL.md

Quality

Content

42%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill is highly actionable with excellent executable examples covering diverse browser automation scenarios, but it severely suffers from verbosity and poor progressive disclosure. The majority of the content (selectors, assertions, actions, network mocking, POM, visual testing, auth patterns) is standard Playwright knowledge that Claude already possesses and should either be omitted or moved to the referenced API_REFERENCE.md file. The core workflow is present but lacks validation checkpoints after execution.

Suggestions

Move all API reference content (Selectors, Assertions, Actions, Network Mocking, Visual Testing, Authentication State, Page Object Model sections) into `references/API_REFERENCE.md` and keep only the critical workflow, quick start, and 1-2 key automation patterns in SKILL.md.

Remove content Claude already knows — best practices like 'trust auto-waiting', 'each test gets fresh context', 'run tests in parallel' are standard Playwright knowledge and waste tokens.

Add validation/verification steps to the workflow: after executing a test script, check exit code, verify screenshots were created, and provide guidance on interpreting failures and retrying.

Trim the troubleshooting and tips sections to only include skill-specific issues (e.g., $SKILL_DIR resolution, run.js wrapper usage) rather than generic Playwright debugging advice.

Dimension	Reasoning	Score
Conciseness	The skill is extremely verbose at ~350+ lines. It includes extensive API reference material (selectors, assertions, actions, network mocking, visual testing, authentication state, page object model) that Claude already knows well. The configuration example, best practices, tips, and troubleshooting sections contain significant redundancy. Much of this content belongs in reference files, not the main SKILL.md.	1 / 3
Actionability	The skill provides fully executable, copy-paste ready code examples throughout — responsive testing, login flows, broken link checking, inline execution, and E2E test patterns. Commands are specific with exact syntax for setup, execution, and troubleshooting.	3 / 3
Workflow Clarity	The critical workflow section provides a clear 5-step sequence for browser automation tasks, including server detection logic with branching. However, there are no validation/verification checkpoints after test execution — no guidance on what to do if scripts fail, no feedback loops for error recovery, and the workflow mixes automation and E2E testing patterns without clear delineation.	2 / 3
Progressive Disclosure	The skill is a monolithic wall of content. Extensive API reference material (selectors, assertions, actions, network mocking, visual testing, auth state, page object model) is inlined rather than placed in the referenced `references/API_REFERENCE.md`. The 'When to Load References' section at the bottom lists topics that are already covered inline, creating redundancy. No bundle files were provided to verify the reference structure.	1 / 3
	Total	7 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong skill description that covers specific capabilities comprehensively, includes abundant natural trigger terms, and clearly delineates both what the skill does and when to use it. The Playwright-specific framing provides clear distinctiveness. The only minor weakness is that 'automate any browser task' is slightly vague, but it's well-supported by the surrounding concrete examples.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: 'test pages, fill forms, take screenshots, check responsive design, validate UX, test login flows, check links, automate any browser task.' Also mentions auto-detecting dev servers and writing clean test scripts.	3 / 3
Completeness	Clearly answers 'what' (browser automation, E2E testing, screenshots, form filling, etc.) and 'when' with an explicit 'Use for' clause covering cross-browser testing, visual regression, API testing, and component testing in specific language ecosystems.	3 / 3
Trigger Term Quality	Excellent coverage of natural terms users would say: 'Playwright', 'E2E testing', 'browser automation', 'screenshots', 'responsive design', 'login flows', 'cross-browser testing', 'visual regression', 'API testing', 'component testing', 'TypeScript', 'JavaScript', 'Python'. These are terms users would naturally use when requesting these capabilities.	3 / 3
Distinctiveness Conflict Risk	Clearly scoped to Playwright-based browser automation and E2E testing, which is a distinct niche. The mention of Playwright as the specific tool, combined with browser-specific actions, makes it unlikely to conflict with other skills like general testing or non-browser automation skills.	3 / 3
	Total	12 / 12 Passed

Validation

81%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 9 / 11 Passed

Validation for skill structure

Criteria	Description	Result
skill_md_line_count	SKILL.md is long (508 lines); consider splitting into references/ and linking	Warning
allowed_tools_field	'allowed-tools' contains unusual tool name(s)	Warning

	Total	9 / 11 Passed

Repository: secondsky/claude-skills
Commit: 5e92b71

Reviewed: about 2 months ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.