CtrlK
BlogDocsLog inGet started
Tessl Logo

examples

Use when working in the examples/ directory, running an example with wrangler dev, adding a new example, or answering questions about EXPOSE directives and the local Docker dev loop. (project)

67

Quality

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Content

87%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is concise, highly actionable, and well-organized with concrete commands and a clear table of available examples. The main gap is workflow clarity: the new-example checklist lacks an explicit build/validate feedback loop.

Suggestions

Add an explicit validation checkpoint to 'Adding a New Example' — e.g., step 4 as 'Run npm run dev and confirm it starts; if the image is stale, run npm run docker:rebuild and retry' — to turn the build check into a verify-fix-retry loop.

Consider a short 'Verify your changes' note covering both the build and a clean-checkout smoke test, so the workflow guards against the stale-image failure mode already mentioned in the tips section.

DimensionReasoningScore

Conciseness

The body is lean and assumes Claude's competence — it gives commands and facts ('npm run dev', 'npm run docker:rebuild') without explaining what Docker or wrangler are, and every section earns its place.

3 / 3

Actionability

Guidance is concrete and executable: exact commands ('npm run dev', 'npm run docker:rebuild', 'docker images | grep sandbox'), a copyable starting-point instruction ('Copy examples/minimal/'), and specific edit targets (package.json name, wrangler.jsonc bindings).

3 / 3

Workflow Clarity

The 'Adding a New Example' section is a clear numbered sequence, but it has no validation checkpoint — step 4 says 'Make sure the example builds and npm run dev works' as an instruction rather than a verify-and-fix loop, and the local-dev tips lack an explicit validate-then-proceed pattern.

2 / 3

Progressive Disclosure

Content is well-organized into focused sections with no external bundle files to reference; for a short single-purpose skill, the sectioned overview (Running, Available Examples, EXPOSE, Adding, Tips) is appropriately structured and easy to navigate.

3 / 3

Total

11

/

12

Passed

Description

72%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description has strong, natural trigger terms and a clear distinct niche, but it leads with 'Use when...' and never states what the skill actually is or does, leaving the 'what' half unanswered. Tightening it to lead with a capability statement would lift completeness and specificity.

Suggestions

Lead with a short capability statement naming what the skill is (e.g., 'Documents and guides work with the sample apps in the examples/ directory'), then keep the 'Use when...' clause so both 'what' and 'when' are explicit.

Reframe some trigger use-cases as concrete actions (e.g., 'running examples with wrangler dev, adding new examples, clarifying EXPOSE directives') to strengthen specificity.

DimensionReasoningScore

Specificity

The description names a concrete domain and several specific triggers ('running an example with wrangler dev', 'adding a new example', 'EXPOSE directives', 'the local Docker dev loop') but frames them as use-cases rather than capability actions, so it is not a comprehensive list of what the skill does.

2 / 3

Completeness

The 'when' is explicit and strong ('Use when working in the examples/ directory...'), but the 'what' is never stated — the description does not say what the skill contains or does, so one half of the required answer is missing.

2 / 3

Trigger Term Quality

Natural, developer-likely terms are well covered — 'examples/', 'wrangler dev', 'EXPOSE directives', and 'Docker dev loop' are exactly the phrases a user would say when needing this skill.

3 / 3

Distinctiveness Conflict Risk

The niche is distinct (sample apps in examples/, wrangler dev, EXPOSE directives, Docker dev loop) with triggers unlikely to overlap with other skills.

3 / 3

Total

10

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation16 / 16 Passed

Validation for skill structure

No warnings or errors.

Repository
cloudflare/sandbox-sdk
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.