result-to-claim

Use when experiments complete to judge what claims the results support, what they don't, and what evidence is still missing. A secondary Codex agent evaluates results against intended claims and routes to next action (pivot, supplement, or confirm). Use after experiments finish — before writing the paper or running ablations.

Quality

—

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Advisory

Suggest reviewing before use

1 medium severity finding. This skill can be installed but you should review these findings before use.

Medium

W011: Third-party content exposure detected (indirect prompt injection risk)

What this means

The skill exposes the agent to untrusted, user-generated content from public third-party sources, creating a risk of indirect prompt injection. This includes browsing arbitrary URLs, reading social media posts or forum comments, and analyzing content from unknown websites.

Why it was flagged

Third-party content exposure detected (medium risk: 0.65). SKILL.md Step 1 collects results from potentially outsider-authored sources (e.g., W&B run history, log files, and legacy docs), then Step 2 pastes those collected results verbatim into the secondary Codex agent’s `message` (free text) at runtime, creating an indirect prompt-injection path.

Report incorrect finding

Repository: wanshuiyin/Auto-claude-code-research-in-sleep
Commit: fe5963c

Audited: about 16 hours ago
Security analysis

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.