CtrlK
BlogDocsLog inGet started
Tessl Logo

test-ui

Test UI system (PanelUI, ScreenSpace) against the poke example using the iwsdk CLI.

65

Quality

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Content

100%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is a well-engineered test runbook: executable commands, precise assertions, explicit validation checkpoints, and a recovery loop. It is concise and self-contained with no wasted tokens or concept exposition.

DimensionReasoningScore

Conciseness

The body is lean operational guidance—concrete commands, assertions, and guardrails—assuming Claude's knowledge of ECS/ScreenSpace with no concept padding; every section earns its place.

3 / 3

Actionability

It provides fully executable `npx iwsdk` commands with concrete JSON inputs and exact expected values (e.g. `maxWidth = 0.5`, `height = "50%"`), making the suites copy-paste ready.

3 / 3

Workflow Clarity

A clear Step 1–5 sequence with explicit validation checkpoints (server-ready polling, connectivity check, log assertions) and a recovery feedback loop (stop→restart→retry, one retry per suite).

3 / 3

Progressive Disclosure

No bundle files exist; the runbook is self-contained and well-organized into clearly headed sections with no nested references and nothing that needs splitting, fitting the self-contained carve-out.

3 / 3

Total

12

/

12

Passed

Description

57%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is specific and clearly distinct, but it only states what the skill does without any explicit when-to-use trigger guidance, capping completeness. Adding a 'Use when…' clause naming natural user triggers would raise the score.

Suggestions

Append an explicit trigger clause, e.g. 'Use when the user asks to test or verify the UI system, PanelUI, ScreenSpace, or the poke example.'

Broaden natural keyword coverage with phrasings a user would actually say (e.g. 'test the UI', 'verify panels', 'check ScreenSpace') rather than relying on component/CLI jargon.

Optionally enumerate the concrete actions (e.g. 'load panels, verify ScreenSpace values, check system/component registration, and run stability checks') to lift specificity to a 3.

DimensionReasoningScore

Specificity

Names the concrete domain and targets ('PanelUI, ScreenSpace', 'poke example', 'iwsdk CLI') but under a single action ('Test'), so it does not list multiple distinct concrete actions as a 3 requires.

2 / 3

Completeness

It clearly states what the skill does but provides no 'Use when…' or equivalent explicit trigger guidance, so per the judging guideline completeness is capped at 2.

2 / 3

Trigger Term Quality

Terms like 'Test UI system', 'poke example', and 'iwsdk CLI' are domain-relevant but fairly technical and lack common natural-language variations a user would say, fitting the 2 anchor rather than the broad coverage of a 3.

2 / 3

Distinctiveness Conflict Risk

The scope is a narrow, well-defined niche (PanelUI/ScreenSpace tested against the poke example via iwsdk CLI) with distinct triggers unlikely to fire for unrelated skills.

3 / 3

Total

9

/

12

Passed

Validation

93%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation15 / 16 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

15

/

16

Passed

Repository
facebook/immersive-web-sdk
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.