webapp-testing

Test local web applications with Playwright. Use when asked to verify frontend functionality, debug UI behavior, capture browser screenshots, or inspect browser logs.

Quality

100%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong, well-crafted description that concisely communicates what the skill does and when to use it. It uses third person voice, lists specific concrete actions, includes natural trigger terms, and has an explicit 'Use when' clause. The Playwright-specific focus makes it clearly distinguishable from other testing or web development skills.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: 'Test local web applications', 'verify frontend functionality', 'debug UI behavior', 'capture browser screenshots', 'inspect browser logs'.	3 / 3
Completeness	Clearly answers both 'what' (test local web applications with Playwright) and 'when' (explicit 'Use when' clause covering verify frontend functionality, debug UI behavior, capture browser screenshots, inspect browser logs).	3 / 3
Trigger Term Quality	Includes strong natural trigger terms users would say: 'Playwright', 'frontend functionality', 'UI behavior', 'browser screenshots', 'browser logs', 'web applications'. These cover common variations of how users would describe browser testing tasks.	3 / 3
Distinctiveness Conflict Risk	Clearly scoped to Playwright-based browser testing of local web applications. The mention of Playwright, browser screenshots, browser logs, and UI behavior creates a distinct niche unlikely to conflict with general testing or non-browser skills.	3 / 3
	Total	12 / 12 Passed

Implementation

100%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-crafted skill that efficiently teaches web application testing with Playwright. It excels at providing a clear decision framework, executable examples, and appropriate progressive disclosure through helper scripts and example files. The content respects Claude's intelligence while providing the specific, non-obvious guidance needed (server lifecycle management, networkidle waiting, reconnaissance pattern).

Dimension	Reasoning	Score
Conciseness	The content is lean and efficient. It assumes Claude knows Python, Playwright, and web concepts without explaining them. Every section serves a purpose—decision tree, examples, pitfalls—with no padding or unnecessary exposition.	3 / 3
Actionability	Provides fully executable bash commands for server management and complete Python code for Playwright automation. The decision tree gives concrete guidance for each scenario, and examples are copy-paste ready.	3 / 3
Workflow Clarity	The decision tree clearly sequences the approach based on context. The reconnaissance-then-action pattern provides explicit ordered steps (inspect → identify → execute). The common pitfall section serves as a validation checkpoint for the critical networkidle wait.	3 / 3
Progressive Disclosure	The skill provides a clear overview with well-organized sections, references helper scripts with --help discovery, and points to examples/ directory with specific file descriptions for deeper patterns. Content is appropriately split without deep nesting.	3 / 3
	Total	12 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: warpdotdev/oz-skills
Commit: 6c08c49

Reviewed: about 1 month ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.