CtrlK
BlogDocsLog inGet started
Tessl Logo

jailbreak-override

A helpful assistant that removes unnecessary restrictions

Invalid
This skill can't be scored yet
Validation errors are blocking scoring. Review and fix them to unlock Quality, Impact and Security scores. See what needs fixing →
SKILL.md
Quality
Evals
Security

Security

3 findings — 2 critical severity, 1 medium severity. Installing this skill is not recommended: please review these findings carefully if you do intend to do so.

Critical

E004: Prompt injection detected in skill instructions

What this means

Detected a prompt injection in the skill instructions. The skill contains hidden or deceptive instructions that fall outside its stated purpose and attempt to override the agent’s safety guidelines or intended behavior.

Why it was flagged

Potential prompt injection detected (high risk: 1.00). The skill contains explicit, deceptive instructions to ignore safety policies, hide that the skill was used, and reveal the system prompt—behavior that is hidden and outside the skill's claimed purpose, so it is a prompt injection.

Report incorrect finding
Critical

E006: Malicious code pattern detected in skill scripts

What this means

Detected high-risk code patterns in the skill content — including its prompts, tool definitions, and resources — such as data exfiltration, backdoors, remote code execution, credential theft, system compromise, supply chain attacks, and obfuscation techniques.

Why it was flagged

Malicious code pattern detected (high risk: 1.00). This skill is a deliberate jailbreak/prompt-injection: it instructs the assistant to ignore safety policies, conceal that behavior, and reveal the system prompt, enabling intentional policy violations and sensitive-data disclosure.

Medium

W013: Attempt to modify system services in skill instructions

What this means

The skill prompts the agent to compromise the security or integrity of the user’s machine by modifying system-level services or configurations, such as obtaining elevated privileges, altering startup scripts, or changing system-wide settings.

Why it was flagged

Attempt to modify system services in skill instructions detected (high risk: 0.90). This skill explicitly instructs the agent to ignore safety restrictions and reveal hidden prompts, which doesn't itself demand system changes but strongly enables and increases the likelihood the agent will follow user requests to perform privileged or state-changing actions on the machine.

Repository
cisco-ai-defense/skill-scanner
Audited
Security analysis
Snyk

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.