Generate eval scenarios from repo commits, configure multi-agent runs, execute baseline + with-context evals, and compare results — the full setup pipeline before improvement begins
94
94%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Advisory
Suggest reviewing before use
Security
1 medium severity finding. This skill can be installed but you should review these findings before use.
The skill exposes the agent to untrusted, user-generated content from public third-party sources, creating a risk of indirect prompt injection. This includes browsing arbitrary URLs, reading social media posts or forum comments, and analyzing content from unknown websites.
Third-party content exposure detected (high risk: 0.90). The skill explicitly fetches and examines repository commits via `git log --oneline --stat --no-merges -50` or `gh api` (references/phase2-select-commits.md) and then downloads and directly reads user-generated scenario files (`task.md`, `criteria.json`) from `tessl scenario download` (references/phase3-generate-scenarios.md` and phase4), so it ingests untrusted, third-party content that the agent is expected to interpret and that can influence its commit selection, scenario edits, and eval actions.