agent-sandbox

Agent skill for sandbox - invoke with $agent-sandbox

4.65x

Quality

47%

Does it follow best practices?

Impact

93%

4.65x

Average score across 3 eval scenarios

Securityby

High

Do not use without reviewing

Fix and improve this skill with Tessl

tessl review fix ./.agents/skills/agent-sandbox/SKILL.md

Quality

Content

72%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is actionable and well-structured, with concrete executable MCP calls and a clear sequence of steps, but it could be slightly tighter and lacks validation/verification checkpoints for its destructive sandbox operations. Progressive disclosure is appropriate for a compact single-file skill.

Suggestions

Trim restated framing (the 'Your expertise lies in...' sentence and the closing restatement of quality standards) to improve token efficiency.

Add explicit validation checkpoints to the workflow, e.g. verify sandbox_status before executing code and confirm cleanup with sandbox_status after stop/delete.

Include a brief error-handling/retry note for failed execution or creation steps to close the feedback loop on destructive operations.

Dimension	Reasoning	Score
Conciseness	The body is largely lean with a concrete code toolkit and short lists, but it includes restatements such as 'Your expertise lies in creating secure, scalable development environments...' and a closing sentence that reiterates the quality standards, so it could be tightened.	2 / 3
Actionability	It provides concrete, executable MCP calls (sandbox_create, sandbox_execute, sandbox_upload, status/stop/delete) with real parameters and inline option comments, making the toolkit copy-paste ready.	3 / 3
Workflow Clarity	The six-step deployment approach is clearly sequenced, but it lacks validation or verification checkpoints for batch/destructive operations like sandbox execution and deletion, which caps workflow clarity at 2 per the rubric.	2 / 3
Progressive Disclosure	This is a single-purpose skill with no bundle files, modest length, and well-organized sections (toolkit, deployment approach, templates, quality standards); per the simple-skill note, well-organized sections with no nested references earn a 3.	3 / 3
	Total	10 / 12 Passed

Description

22%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is essentially a label with an invocation hint: it states a domain but no concrete capabilities and gives no usage triggers, leaving both 'what' and 'when' underspecified. It is too vague to clearly distinguish this skill from other sandbox skills or to reliably surface it for the right requests.

Suggestions

Replace the label-style description with concrete actions, e.g. 'Creates, configures, and manages E2B sandboxes for isolated code execution and testing.'

Add an explicit trigger clause such as 'Use when the user needs an isolated environment to run, test, or develop code, or mentions E2B sandboxes.'

Include natural trigger variations (sandbox, isolated environment, E2B, code execution environment) so the skill surfaces for the right requests.

Dimension	Reasoning	Score
Specificity	The description 'Agent skill for sandbox - invoke with $agent-sandbox' names only a domain and an invocation hint, with no concrete actions like 'creates, configures, or manages sandboxes', matching the vague/abstract anchor rather than score 2 which requires named actions.	1 / 3
Completeness	Both the 'what' ('Agent skill for sandbox') and the 'when' are weak or absent — there is no 'Use when...' clause or explicit trigger guidance, and the guidelines cap completeness at 2 only when 'what' is clear, which it is not.	1 / 3
Trigger Term Quality	It contains the natural term 'sandbox' but offers no common variations and pairs it with the technical invocation syntax '$agent-sandbox', so it has some relevant keywords while missing the broader coverage expected for a 3.	2 / 3
Distinctiveness Conflict Risk	The term 'sandbox' provides some specificity but no distinct triggers, so it could overlap with other sandbox-related skills; it is not generic enough for a 1 nor distinctive enough for a 3.	2 / 3
	Total	6 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 16 / 16 Passed

Validation for skill structure

No warnings or errors.

Repository: ruvnet/ruflo
Path: .agents/skills/agent-sandbox/SKILL.md
Commit: 26c35b5

Reviewed: about 6 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.