Debugs a failing production call, reproduces the bug with Cekura evaluators, implements a fix, verifies it, runs regression tests, then raises a PR with evidence. Use when the user wants to fix a production call bug, investigate a failing prod call, reproduce and fix a production issue, run regression tests before a PR, or says things like "fix this prod call issue", "debug and fix call ID", "test my fix against prod scenarios", "reproduce this production bug", or "regression test before raising PR".
75
92%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Security
2 findings — 2 medium severity. This skill can be installed but you should review these findings before use.
The skill exposes the agent to untrusted, user-generated content from public third-party sources, creating a risk of indirect prompt injection. This includes browsing arbitrary URLs, reading social media posts or forum comments, and analyzing content from unknown websites.
Third-party content exposure detected (high risk: 0.85). The workflow ingests the production call transcript (`transcript_object` array of turns with `role + content`) fetched at runtime via `get_call "CALL_ID"` (outsider-authored free text from the caller), and that transcript content is then replayed verbatim into the evaluator’s conditional actions in Phase 2, which the agent/LLM uses as input.
The skill fetches instructions or code from an external URL at runtime, and the fetched content directly controls the agent’s prompts or executes code. This dynamic dependency allows the external source to modify the agent’s behavior without any changes to the skill itself.
Potentially malicious external URL detected (high risk: 0.90). The skill instructs fetching the agent record at runtime via GET /test_framework/v1/ai-agents/{metadata.agent_id}/ to extract the agent "description" (the system prompt) which is then used as the evaluator/system instructions—i.e., remote content fetched at runtime directly controls prompts, so this URL is a required runtime dependency.
7a49e22
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.