CtrlK
BlogDocsLog inGet started
Tessl Logo

develop-web-game

Use when Codex is building or iterating on a web game (HTML/JS) and needs a reliable development + testing loop: implement small changes, run a Playwright-based test script with short input bursts and intentional pauses, inspect screenshots/text, and review console errors with render_game_to_text.

86

2.65x
Quality

81%

Does it follow best practices?

Impact

93%

2.65x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Discovery

85%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a well-structured description that clearly defines both what the skill does and when to use it, with a strong 'Use when' clause upfront. Its main weakness is that some trigger terms are overly technical (e.g., 'render_game_to_text', 'Playwright-based') and it could benefit from including more natural user-facing language. The description is distinctive and unlikely to conflict with other skills due to its specific niche.

Suggestions

Add more natural user-facing trigger terms like 'browser game', 'test my game', 'debug game', 'game not working' alongside the technical terms to improve discoverability.

Consider rephrasing to use third-person voice throughout (e.g., 'Implements small changes and runs Playwright-based tests...') rather than the imperative/instructional tone currently used.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: implement small changes, run Playwright-based test scripts with short input bursts and intentional pauses, inspect screenshots/text, and review console errors with render_game_to_text.

3 / 3

Completeness

Clearly answers both what (implement changes, run Playwright tests, inspect screenshots/text, review console errors) and when ('Use when Codex is building or iterating on a web game (HTML/JS) and needs a reliable development + testing loop').

3 / 3

Trigger Term Quality

Includes some relevant terms like 'web game', 'HTML/JS', 'Playwright', 'screenshots', 'console errors', but uses somewhat technical jargon ('render_game_to_text', 'Playwright-based test script') and misses common user phrases like 'browser game', 'test my game', 'debug game'. The term 'Codex' is very specific but may not be what users naturally say.

2 / 3

Distinctiveness Conflict Risk

Highly distinctive niche combining web game development, Playwright testing, screenshot inspection, and the specific render_game_to_text tool. Unlikely to conflict with general coding, testing, or game design skills.

3 / 3

Total

11

/

12

Passed

Implementation

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a strong, actionable skill with a well-defined workflow and clear validation checkpoints for an iterative game development loop. Its main weakness is verbosity—several sections repeat instructions (progress.md handling, screenshot inspection emphasis) and some content like Core Game Guidelines could be split into a reference file. The actionability and workflow clarity are excellent, with concrete code examples and explicit feedback loops.

Suggestions

Move 'Core Game Guidelines' and 'Playwright Prerequisites' into separate reference files to reduce the main skill's token footprint and improve progressive disclosure.

Consolidate the duplicated progress.md instructions (mentioned in both the workflow step 5 and the 'Progress Tracking' section) into a single location.

DimensionReasoningScore

Conciseness

The skill is fairly detailed and mostly relevant, but includes some unnecessary elaboration (e.g., explaining what progress.md should contain multiple times, repeating the screenshot inspection mandate verbosely in 'Test Artifacts to Review'). Some sections like Playwright Prerequisites explain basic npm concepts Claude already knows.

2 / 3

Actionability

The skill provides concrete, executable commands (node invocations with flags), copy-paste-ready code patterns (render_game_to_text, advanceTime), specific JSON action payload examples, and clear environment variable setup. Guidance is specific and directly usable.

3 / 3

Workflow Clarity

The 14-step workflow is clearly sequenced with explicit validation checkpoints (inspect screenshots, check errors, verify controls, reset between scenarios). It includes feedback loops (fix and rerun in a tight loop) and error recovery steps (fix the first new error before continuing). The iterate-with-small-deltas step explicitly references repeating steps 7-13.

3 / 3

Progressive Disclosure

The skill references external files ($WEB_GAME_CLIENT, $WEB_GAME_ACTIONS) appropriately, but the main document itself is quite long with inline content that could be split out (e.g., Core Game Guidelines, Playwright Prerequisites). The structure uses headers well but the document is somewhat monolithic for its length.

2 / 3

Total

10

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
openai/skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.