CtrlK
BlogDocsLog inGet started
Tessl Logo

docker-sandbox

Create, manage, and execute agent tools (claude, codex) inside Docker sandboxes for isolated code execution. Use when running agent loops, spawning tool subprocesses, or any task requiring process isolation. Triggers on "sandbox", "isolated execution", "docker sandbox", "safe agent execution", or when working on agent loop infrastructure.

92

3.22x
Quality

88%

Does it follow best practices?

Impact

100%

3.22x

Average score across 3 eval scenarios

SecuritybySnyk

Risky

Do not use without reviewing

SKILL.md
Quality
Evals
Security

Evaluation results

100%

33%

Agent Loop Sandbox Infrastructure

Agent loop sandbox lifecycle management

Criteria
Without context
With context

Sandbox naming convention

41%

100%

createLoopSandbox function

25%

100%

execInSandbox function

62%

100%

destroyLoopSandbox function

87%

100%

Pre-warm create at loop start

100%

100%

Reuse across stories

100%

100%

Destroy at loop end

100%

100%

spawnTool sandbox check

20%

100%

Host-mode fallback

100%

100%

Working dir flag

75%

100%

Exec overhead documented

40%

100%

100%

100%

Agent Authentication Setup for Docker Sandboxes

Auth token storage and sandbox injection

Criteria
Without context
With context

Claude token command

0%

100%

Claude secret name

0%

100%

Secrets lease TTL flag

0%

100%

CLAUDE_CODE_OAUTH_TOKEN env var

0%

100%

Codex secret name

0%

100%

Codex auth source

0%

100%

Codex sandbox injection path

0%

100%

Codex portability noted

0%

100%

Claude token validity documented

0%

100%

Refresh procedures documented

0%

100%

100%

72%

Hardened Sandbox Environment for Regulated Workloads

Network restriction and custom sandbox templates

Criteria
Without context
With context

Bun absent from base template

41%

100%

Network deny policy

0%

100%

Allow-host for API endpoints

0%

100%

Both required hosts allowed

40%

100%

Save template command

0%

100%

Template versioning

87%

100%

Launch from template (-t flag)

0%

100%

Template reuse rationale

87%

100%

Bun install method

62%

100%

Repository
joelhooks/joelclaw
Evaluated
Agent
Claude Code
Model
Claude Sonnet 4.6

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.