test-ui

Test UI system (PanelUI, ScreenSpace) against the poke example using the iwsdk CLI.

Quality

—

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Quality

Content

100%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is a well-engineered test runbook: executable commands, precise assertions, explicit validation checkpoints, and a recovery loop. It is concise and self-contained with no wasted tokens or concept exposition.

Dimension	Reasoning	Score
Conciseness	The body is lean operational guidance—concrete commands, assertions, and guardrails—assuming Claude's knowledge of ECS/ScreenSpace with no concept padding; every section earns its place.	3 / 3
Actionability	It provides fully executable `npx iwsdk` commands with concrete JSON inputs and exact expected values (e.g. `maxWidth = 0.5`, `height = "50%"`), making the suites copy-paste ready.	3 / 3
Workflow Clarity	A clear Step 1–5 sequence with explicit validation checkpoints (server-ready polling, connectivity check, log assertions) and a recovery feedback loop (stop→restart→retry, one retry per suite).	3 / 3
Progressive Disclosure	No bundle files exist; the runbook is self-contained and well-organized into clearly headed sections with no nested references and nothing that needs splitting, fitting the self-contained carve-out.	3 / 3
	Total	12 / 12 Passed

Description

57%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is specific and clearly distinct, but it only states what the skill does without any explicit when-to-use trigger guidance, capping completeness. Adding a 'Use when…' clause naming natural user triggers would raise the score.

Suggestions

Append an explicit trigger clause, e.g. 'Use when the user asks to test or verify the UI system, PanelUI, ScreenSpace, or the poke example.'

Broaden natural keyword coverage with phrasings a user would actually say (e.g. 'test the UI', 'verify panels', 'check ScreenSpace') rather than relying on component/CLI jargon.

Optionally enumerate the concrete actions (e.g. 'load panels, verify ScreenSpace values, check system/component registration, and run stability checks') to lift specificity to a 3.

Dimension	Reasoning	Score
Specificity	Names the concrete domain and targets ('PanelUI, ScreenSpace', 'poke example', 'iwsdk CLI') but under a single action ('Test'), so it does not list multiple distinct concrete actions as a 3 requires.	2 / 3
Completeness	It clearly states what the skill does but provides no 'Use when…' or equivalent explicit trigger guidance, so per the judging guideline completeness is capped at 2.	2 / 3
Trigger Term Quality	Terms like 'Test UI system', 'poke example', and 'iwsdk CLI' are domain-relevant but fairly technical and lack common natural-language variations a user would say, fitting the 2 anchor rather than the broad coverage of a 3.	2 / 3
Distinctiveness Conflict Risk	The scope is a narrow, well-defined niche (PanelUI/ScreenSpace tested against the poke example via iwsdk CLI) with distinct triggers unlikely to fire for unrelated skills.	3 / 3
	Total	9 / 12 Passed

Validation

93%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 15 / 16 Passed

Validation for skill structure

Criteria	Description	Result
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	15 / 16 Passed

Repository: facebook/immersive-web-sdk
Commit: 42af866

Reviewed: 6 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.