docker-sandbox

Create, manage, and execute agent tools (claude, codex) inside Docker sandboxes for isolated code execution. Use when running agent loops, spawning tool subprocesses, or any task requiring process isolation. Triggers on "sandbox", "isolated execution", "docker sandbox", "safe agent execution", or when working on agent loop infrastructure.

3.22x

Quality

88%

Does it follow best practices?

Impact

100%

3.22x

Average score across 3 eval scenarios

Securityby

Risky

Do not use without reviewing

Evaluation results

100%

33%

Agent Loop Sandbox Infrastructure

Agent loop sandbox lifecycle management

Criteria

Without context

With context

Sandbox naming convention

41%

100%

createLoopSandbox function

25%

100%

execInSandbox function

62%

100%

destroyLoopSandbox function

87%

100%

Pre-warm create at loop start

100%

Reuse across stories

100%

Destroy at loop end

100%

spawnTool sandbox check

20%

100%

Host-mode fallback

100%

Working dir flag

75%

100%

Exec overhead documented

40%

100%

Agent Authentication Setup for Docker Sandboxes

Auth token storage and sandbox injection

Criteria

Without context

With context

Claude token command

100%

Claude secret name

100%

Secrets lease TTL flag

100%

CLAUDE_CODE_OAUTH_TOKEN env var

100%

Codex secret name

100%

Codex auth source

100%

Codex sandbox injection path

100%

Codex portability noted

100%

Claude token validity documented

100%

Refresh procedures documented

100%

72%

Hardened Sandbox Environment for Regulated Workloads

Network restriction and custom sandbox templates

Criteria

Without context

With context

Bun absent from base template

41%

100%

Network deny policy

100%

Allow-host for API endpoints

100%

Both required hosts allowed

40%

100%

Save template command

100%

Template versioning

87%

100%

Launch from template (-t flag)

100%

Template reuse rationale

87%

100%

Bun install method

62%

100%

Repository: joelhooks/joelclaw
Commit: 2ca3686

Evaluated: 4 months ago
Agent: Claude Code
Model: Claude Sonnet 4.6

Table of Contents

Agent Loop Sandbox Infrastructure Agent Authentication Setup for Docker Sandboxes Hardened Sandbox Environment for Regulated Workloads

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.