Name: benchmark-sandbox
Rating: 80.80000000000001 (1 reviews)
Author: vercel

benchmark-sandbox

Run vercel-plugin eval scenarios in Vercel Sandboxes instead of local WezTerm panels. Provisions ephemeral microVMs with Claude Code + plugin pre-installed, runs benchmark prompts, extracts hook artifacts, and produces coverage reports.

2.09x

Quality

72%

Does it follow best practices?

Impact

92%

2.09x

Average score across 3 eval scenarios

Securityby

Advisory

Suggest reviewing before use

Fix and improve this skill with Tessl

tessl review fix ./.claude/skills/benchmark-sandbox/SKILL.md

1 medium severity finding. This skill can be installed but you should review these findings before use.

Medium

W013: Attempt to modify system services in skill instructions.

What this means

The skill prompts the agent to compromise the security or integrity of the user’s machine by modifying system-level services or configurations, such as obtaining elevated privileges, altering startup scripts, or changing system-wide settings.

Why it was flagged

The prompt instructs the agent to bypass agent-level permission checks (use --dangerously-skip-permissions), write auth tokens and install global packages and run commands that change the sandbox filesystem, but it does not request sudo, modify system-level files that require root, or create new OS users.

Report incorrect finding

Repository: vercel/vercel-plugin
Path: .claude/skills/benchmark-sandbox/SKILL.md
Commit: 19606ac

Audited: 2 months ago
Security analysis

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.