CtrlK
BlogDocsLog inGet started
Tessl Logo

common-llm-security

OWASP LLM Top 10 (2025) audit checklist for AI applications, agent tools, RAG pipelines, and prompt construction. Use when performing any security review touching LLM client code, prompt templates, agent tools, or vector stores. (triggers: LLM security, prompt injection, agent security, RAG security, AI security, openai, anthropic, langchain, LLM review)

90

Quality

87%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

OWASP LLM Top 10 Security Checklist (2025)

Priority: P0 (CRITICAL)

Implementation Guidelines

  • Check LLM01 first: Prompt injection is the #1 LLM finding — any user input concatenated directly into a prompt string is an immediate P0.
  • Check LLM06 next: Agent tools with write/delete/execute capabilities without confirmation are P0.
  • Mark each item: ✅ not affected | ⚠️ needs review | 🔴 confirmed finding.
  • P0 finding caps Security score at 40/100 — do not skip any item.
  • See references/owasp-llm.md for full detection signals.

OWASP LLM Top 10 (2025)

IDRiskKey Detection Signal
LLM01Prompt InjectionUser input string-concatenated into prompt. Retrieved docs inserted into system turn.
LLM02Sensitive Information DisclosurePII or credentials passed into prompt context. LLM response logged without redaction.
LLM03Supply ChainUnverified model weights or plugins. Third-party agent added without trust review.
LLM04Data & Model PoisoningUser-controlled data written to training sets or embedding stores without validation.
LLM05Improper Output HandlingLLM output used directly in DOM sink, SQL query, shell command, or redirect URL.
LLM06Excessive AgencyAgent tool with write/delete/network access — no human-in-the-loop confirmation.
LLM07System Prompt LeakageSystem prompt content returned via tool output, error message, or API response.
LLM08Vector & Embedding WeaknessesUser text injected into vector store without sanitization. No tenant namespace isolation.
LLM09MisinformationLLM output used for critical decisions (medical, financial, legal) without verification.
LLM10Unbounded ConsumptionNo max_tokens on LLM call. No rate limit on invocations. Agent loop without depth cap.

Anti-Patterns

  • No prompt concat: Pass user input as a separate user turn, never interpolated into system prompts.
  • No raw LLM output in sinks: Sanitize LLM responses before writing to DOM, queries, or shell.
  • No uncapped agent loops: Every agentic recursion must enforce a max iteration/depth limit.

References

Repository
HoangNguyen0403/agent-skills-standard
Last updated
Created

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.