MUST be used whenever fixing security issues in a Flows app, or before shipping any feature that handles credentials, user input, or external data. This skill finds AND fixes security problems — it does not just report them. Do NOT skip this when the user asks for a security fix, security hardening, or vulnerability remediation — run every step in order. Triggers: security, security fix, security hardening, vulnerability, XSS, injection, credentials, secrets, auth, authentication, authorization, token, sensitive data, input validation, CORS, CSP, dependency audit.
88
83%
Does it follow best practices?
Impact
100%
1.08xAverage score across 3 eval scenarios
Passed
No known issues
Quality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong skill description with excellent trigger term coverage and clear 'when to use' guidance. Its main weakness is that the 'what it does' portion is somewhat vague — it says it 'finds AND fixes security problems' but doesn't enumerate the specific concrete actions or steps it performs. The imperative/instructional tone ('MUST be used', 'Do NOT skip') is unusual but functional for skill selection.
Suggestions
Add specific concrete actions the skill performs, e.g., 'Scans for XSS vulnerabilities, audits dependencies for known CVEs, validates input sanitization, reviews CORS/CSP configurations, checks credential handling patterns.'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description mentions 'finds AND fixes security problems' and references handling 'credentials, user input, or external data,' but it doesn't list multiple specific concrete actions (e.g., what specific fixes it performs, what steps are involved). It names the domain and some actions but isn't comprehensive about the concrete capabilities. | 2 / 3 |
Completeness | Clearly answers both 'what' (finds and fixes security problems in a Flows app) and 'when' (when fixing security issues, before shipping features handling credentials/user input/external data, when user asks for security fix/hardening/vulnerability remediation). Explicit trigger guidance is provided. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms including 'security fix', 'vulnerability', 'XSS', 'injection', 'credentials', 'secrets', 'auth', 'authentication', 'authorization', 'token', 'sensitive data', 'input validation', 'CORS', 'CSP', 'dependency audit'. These are terms users would naturally use when requesting security-related work. | 3 / 3 |
Distinctiveness Conflict Risk | Clearly scoped to security issues in a 'Flows app' context, with distinct security-specific triggers. The combination of domain (Flows app security) and extensive security-specific trigger terms makes it unlikely to conflict with non-security skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a strong, highly actionable security audit skill with excellent workflow clarity and concrete, executable guidance at every step. Its main weakness is length — while most content is valuable, the skill could benefit from splitting detailed fix patterns into referenced files to reduce the token footprint of the main SKILL.md. Minor verbosity in explanatory passages could also be trimmed.
Suggestions
Extract detailed fix patterns (e.g., the CDF SDK migration table, Zod validation examples, CSP header templates) into separate referenced files to reduce the main skill's token footprint.
Trim explanatory text that Claude already knows — e.g., remove 'Never pass user-controlled strings to code evaluation' and similar general security advice, keeping only the specific fix instructions.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is fairly long but most content earns its place — the grep commands, fix patterns, and code examples are all actionable. However, some sections include explanatory text that Claude doesn't need (e.g., explaining what CDF is, why secrets are bad, what DOMPurify does). The 'What is acceptable' subsection in Step 2 and some of the table entries could be tightened. | 2 / 3 |
Actionability | Excellent actionability throughout. Every step includes concrete grep/bash commands to find issues, specific code examples showing before/after fixes, exact package install commands (pnpm add dompurify, pnpm add zod), and copy-paste ready configuration blocks (CSP headers, Zod schemas, DOMPurify usage). | 3 / 3 |
Workflow Clarity | The 9-step workflow is clearly sequenced with a logical progression (map surface → fix SDK usage → credentials → DOM → auth → validation → config → dependencies → report). Each step includes both detection and remediation. Step 9 provides a validation/reporting checkpoint with a summary table. The 'find AND fix' mandate with the final report of unfixable items creates a clear feedback loop. | 3 / 3 |
Progressive Disclosure | The content is a single monolithic file with no references to supporting documents, which is understandable given no bundle files exist. However, at ~250+ lines, some sections (like the CDF SDK migration table, the Vite config details, or the Zod validation examples) could benefit from being split into referenced files. The structure within the file is well-organized with clear headers and numbered steps. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
metadata_version | 'metadata.version' is missing | Warning |
Total | 9 / 11 Passed | |
d6af887
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.