CtrlK
BlogDocsLog inGet started
Tessl Logo

arn-spark-clickable-prototype

This skill should be used when the user says "clickable prototype", "arn clickable prototype", "interactive prototype", "test interactions", "validate UX", "user journeys", "test navigation", "make it clickable", "prototype interactions", "test the prototype", "build the screens", "create the UI", "screen mockups", or wants to generate a clickable interactive prototype with linked screens and validate it through iterative build-review cycles with Playwright-based interaction testing, per-criterion scoring, an independent judge verdict, and versioned output.

53

Quality

59%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/arn-spark/skills/arn-spark-clickable-prototype/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

72%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description provides strong trigger term coverage and occupies a distinctive niche, but suffers from poor structure—it's a single run-on sentence that front-loads trigger phrases and then crams the capability description into a subordinate clause. The 'what it does' portion would benefit from being broken into clearly listed concrete actions, and the overall readability would improve with separation of the 'what' and 'when' sections.

Suggestions

Restructure into two clear parts: first list specific capabilities (e.g., 'Generates clickable interactive prototypes with linked screens. Validates UX through Playwright-based interaction testing. Provides per-criterion scoring and independent judge verdicts. Produces versioned output.'), then add a 'Use when...' clause with the trigger terms.

Reduce the trigger phrase list to the most essential natural terms and group synonyms, rather than exhaustively listing every variation in a comma-separated block.

DimensionReasoningScore

Specificity

The description mentions some concrete actions like 'generate a clickable interactive prototype with linked screens', 'Playwright-based interaction testing', 'per-criterion scoring', 'independent judge verdict', and 'versioned output', but these are crammed into a single long clause rather than clearly listed as distinct capabilities. The specificity is moderate—it names the domain and several actions but reads more like a run-on sentence than a structured capability list.

2 / 3

Completeness

The description opens with 'This skill should be used when...' which addresses the 'when' aspect through trigger phrases, and the latter half describes 'what' it does (generate prototypes, test with Playwright, score, judge, version output). However, the 'what' portion is not clearly separated or explicitly structured—it's embedded in a single long sentence. The 'when' is present but is essentially just a list of trigger phrases rather than a clear 'Use when...' clause describing scenarios. It partially answers both but neither is cleanly articulated.

2 / 3

Trigger Term Quality

The description includes an extensive list of natural trigger phrases users would say: 'clickable prototype', 'interactive prototype', 'test interactions', 'validate UX', 'user journeys', 'make it clickable', 'screen mockups', 'create the UI', 'build the screens', etc. These cover many natural variations a user might use.

3 / 3

Distinctiveness Conflict Risk

The skill has a very clear niche: clickable interactive prototypes with Playwright-based testing, scoring, and judge verdicts. The combination of prototyping + automated interaction testing + scoring is highly distinctive and unlikely to conflict with generic UI or design skills.

3 / 3

Total

10

/

12

Passed

Implementation

47%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill has excellent workflow clarity with well-defined steps, validation checkpoints, and error recovery paths, but is severely undermined by extreme verbosity. At 400+ lines, it explains far more than Claude needs — repeating fallback logic across workflow steps and error handling, over-specifying conditional paths for prerequisites, and including conversational templates that inflate token usage. The content would benefit enormously from aggressive condensation and splitting detailed reference material into separate files.

Suggestions

Reduce the prerequisites section to a concise checklist with file paths, moving the conditional logic (e.g., fresh design asset fetching flow) to a separate reference file.

Eliminate duplication between the workflow steps and the Error Handling / Agent Invocation Guide sections — keep the canonical logic in one place and reference it.

Extract the conversational templates (user-facing messages with tables and options) into a separate templates reference file, leaving only the intent and key parameters inline.

Add concrete executable examples for key operations like starting the prototype server, invoking agents via the Task tool, and running Playwright scripts — currently these are described abstractly.

DimensionReasoningScore

Conciseness

Extremely verbose at ~400+ lines. Extensively explains orchestration details, agent invocation patterns, and fallback logic that Claude could infer from shorter instructions. Many sections repeat information (e.g., error handling duplicates fallback logic already stated in workflow steps). The prerequisites section alone is massive with conditional logic that could be condensed significantly.

1 / 3

Actionability

The skill provides detailed step-by-step orchestration instructions and clear agent invocation patterns, but contains no executable code examples — all commands are described abstractly (e.g., 'Start the prototype in the background via Bash' without showing how). The workflow is concrete in structure but relies on external agents and templates without showing their actual invocation syntax.

2 / 3

Workflow Clarity

The multi-step workflow is clearly sequenced with numbered steps, explicit validation checkpoints (expert review scoring, judge review, user review), feedback loops (failing criteria feed back into next build cycle), and clear decision points with branching logic. Error recovery is thoroughly addressed with fallback paths for each failure mode.

3 / 3

Progressive Disclosure

The skill references external templates and guides (journey-template.md, review-report-template.md, clickable-prototype-criteria.md, showcase-capture-guide.md) which is good progressive disclosure, but the main SKILL.md itself is a monolithic wall of text that inlines enormous amounts of detail that could be split into separate reference files. The agent invocation guide and error handling sections duplicate workflow content and could be separate files.

2 / 3

Total

8

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
AppsVortex/arness
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.