CtrlK
BlogDocsLog inGet started
Tessl Logo

gtfobins-validate

Validate shell builtins against GTFOBins attack patterns to ensure exploits are blocked by the sandbox

52

Quality

58%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./.claude/skills/gtfobins-validate/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a strong, security-focused skill with excellent actionability and workflow clarity. The multi-step process is well-sequenced with validation checkpoints and clear escalation for critical findings. The main weaknesses are moderate verbosity (some redundancy between the classification table, known patterns section, and notes) and the monolithic structure that could benefit from splitting reference material into separate files.

Suggestions

Consolidate the 'Known GTFOBins attack patterns' section and 'Notes' section into a single concise reference table to reduce redundancy with the workflow steps.

Consider extracting the detailed Go test patterns into a separate PATTERNS.md file referenced from the main skill, keeping only one representative example inline.

DimensionReasoningScore

Conciseness

The skill is reasonably efficient but includes some redundancy — the 'Known GTFOBins attack patterns' section partially duplicates what the workflow already covers, and the Notes section restates design constraints mentioned in the classification table. The security preamble is justified but lengthy. Overall mostly efficient with room to tighten.

2 / 3

Actionability

Provides fully executable Go test patterns with concrete naming conventions, specific file paths (interp/builtins/<command>/builtin_<command>_pentest_test.go), exact bash commands to run tests, and real examples of flag patterns to reject. The test templates are copy-paste ready with clear adaptation points.

3 / 3

Workflow Clarity

The 6-step workflow is clearly sequenced with explicit validation checkpoints — Step 5 runs tests and verifies, the 'Critical findings' section defines a stop-and-report feedback loop for exploitable techniques, and the classification table in Step 3 provides clear decision criteria for each attack category. The escalation path for failures is well-defined.

3 / 3

Progressive Disclosure

The content is well-structured with clear sections and headers, but it's a monolithic document with no references to external files for detailed content. The 'Known GTFOBins attack patterns' reference section and the detailed test patterns could be split into separate files. No bundle files are provided to support progressive disclosure, though the skill does reference external paths like resources/gtfobins/<command>.md appropriately.

2 / 3

Total

10

/

12

Passed

Description

40%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description targets a clear and distinctive security niche (GTFOBins shell builtin validation), which makes it unlikely to conflict with other skills. However, it lacks an explicit 'Use when...' clause and could benefit from listing more concrete actions and natural trigger terms that users might employ when needing this skill.

Suggestions

Add an explicit 'Use when...' clause, e.g., 'Use when validating shell commands for security, checking for GTFOBins exploits, or auditing sandbox escape risks.'

Include additional natural trigger terms users might say, such as 'privilege escalation', 'security audit', 'command injection', 'LOLBAS', or 'shell escape'.

List more specific concrete actions, e.g., 'Checks shell builtins against known GTFOBins attack patterns, flags potential sandbox escape vectors, and reports blocked exploit paths.'

DimensionReasoningScore

Specificity

The description names a specific domain (shell builtins, GTFOBins attack patterns, sandbox) and a core action (validate/ensure exploits are blocked), but it doesn't list multiple concrete actions—it's essentially one action described with context.

2 / 3

Completeness

The description answers 'what' (validate shell builtins against GTFOBins patterns) but has no explicit 'Use when...' clause or equivalent trigger guidance, which per the rubric should cap completeness at 2—and since the 'when' is entirely missing (not even implied beyond the action itself), this scores a 1.

1 / 3

Trigger Term Quality

Includes relevant technical keywords like 'shell builtins', 'GTFOBins', 'sandbox', and 'exploit', which are terms a security-minded user might use. However, it misses common variations like 'privilege escalation', 'security audit', 'command injection', or 'LOLBAS' that users might naturally say.

2 / 3

Distinctiveness Conflict Risk

This is a very specific niche—GTFOBins validation of shell builtins in a sandbox context. It is highly unlikely to conflict with other skills given the narrow and specialized domain.

3 / 3

Total

8

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
DataDog/rshell
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.