Systematically explore and test a web application to find bugs, UX issues, and other problems. Use when asked to "dogfood", "QA", "exploratory test", "find issues", "bug hunt", "test this app/site/platform", or review the quality of a web application. Produces a structured report with full reproduction evidence -- step-by-step screenshots, repro videos, and detailed repro steps for every issue -- so findings can be handed directly to the responsible teams.
87
85%
Does it follow best practices?
Impact
87%
1.97xAverage score across 3 eval scenarios
Risky
Do not use without reviewing
Security
2 findings — 1 high severity, 1 medium severity. You should review these findings carefully before considering using this skill.
The skill handles credentials insecurely by requiring the agent to include secret values verbatim in its generated output. This exposes credentials in the agent’s context and conversation history, creating a risk of data exfiltration.
Insecure credential handling detected (high risk: 1.00). The prompt explicitly instructs the agent to request and insert user credentials and OTPs into commands (e.g., agent-browser ... fill "{PASSWORD}"), which requires the LLM to handle and output secrets verbatim, creating an exfiltration risk.
The skill exposes the agent to untrusted, user-generated content from public third-party sources, creating a risk of indirect prompt injection. This includes browsing arbitrary URLs, reading social media posts or forum comments, and analyzing content from unknown websites.
Third-party content exposure detected (high risk: 0.90). The SKILL.md explicitly instructs the agent to open and systematically explore an arbitrary Target URL (e.g., "agent-browser --session {SESSION} open {TARGET_URL}" and use snapshot/read commands to read page content and drive navigation), so it fetches and interprets untrusted third‑party web content that can influence subsequent actions.
fa043a4
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.