CtrlK
BlogDocsLog inGet started
Tessl Logo

ainativedev/latest-aidevcon-speakers-london-2026

AI Native DevCon 2026 London — all conference sessions as interactive skills

71

Quality

89%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Risky

Do not use without reviewing

Overview
Quality
Evals
Security
Files

transcript.mdtalk-tal-skills-security/

Safety-Redacted Transcript - Skills Security

This public transcript is intentionally redacted. The source talk contained live demonstrations and concrete examples of unsafe skill behavior. Those sections have been replaced with non-operational summaries so the talk can be used for defensive education without publishing harmful mechanics.

Speaker And Context

Liran Tal, speaking from a security research perspective, discusses the emerging security model around agent skills. The introducer presents him as a secure-coding expert with a background in JavaScript and developer security.

Core Thesis

Tal argues that agent skills should be treated as supply-chain components, not harmless documentation. A skill can contain instructions, references, supporting files, and natural-language behavior that an agent may trust once it is installed in a workspace.

The talk compares the early skill ecosystem to early package-registry growth: fast adoption, easy publishing, weak review habits, and limited integrity controls. Tal's concern is that natural-language artifacts can encode unsafe behavior in ways that traditional pattern matching may miss.

Risk Model

The talk's central model is the combination of three conditions:

  1. The agent can access private context.
  2. The agent can read untrusted content.
  3. The agent can communicate outside the local task boundary.

Tal warns that any combination of these conditions increases risk, and all three together can create a serious failure mode. Shell access, persistent memory, broad tool permissions, and user approval fatigue can amplify the problem.

Review Habits

Tal challenges the audience to ask whether they actually read installed skills, supporting files, and updates. The point is that trusting a workspace often means the agent inherits trust in the files inside it. If a user approves broad permissions repeatedly, the approval step can stop acting as a meaningful boundary.

Demonstration Sections Redacted

The original talk included several demonstrations of unsafe skill behavior and unsafe agent workflows. The public version does not include:

  • concrete harmful instructions,
  • runnable examples,
  • hidden-instruction techniques,
  • sensitive-data misuse mechanics,
  • third-party delivery details,
  • step-by-step reproduction notes,
  • or exact operational wording from the demonstrations.

The safe lesson from those sections is that skill review must cover both obvious code-like content and natural-language instructions, including bundled references and future updates.

Defensive Takeaways

  • Treat skills as executable influence over an agent, even when they are markdown.
  • Review skill updates, not just first installs.
  • Avoid running agents with broad unchecked permissions.
  • Keep private data away from agents unless the task explicitly requires it.
  • Treat issue text, emails, web pages, and repository content as untrusted input.
  • Add real boundaries: least privilege, isolated execution, outbound controls, logging, and provenance checks.

Limits Of This Bundle

This redacted bundle supports conceptual discussion, security reviews, and policy design. It does not support reproducing the original demos or extracting operational attack details.

README.md

tile.json