Generates images and text via reverse-engineered Gemini Web API. Supports text generation, image generation from prompts, reference images for vision input, and multi-turn conversations. Use when other skills need image generation backend, or when user requests "generate image with Gemini", "Gemini text generation", or needs vision-capable AI generation.
66
82%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Security
1 medium severity finding. This skill can be installed but you should review these findings before use.
The skill exposes the agent to untrusted, user-generated content from public third-party sources, creating a risk of indirect prompt injection. This includes browsing arbitrary URLs, reading social media posts or forum comments, and analyzing content from unknown websites.
Third-party content exposure detected (high risk: 0.80). The skill makes HTTP requests to public Gemini/Google endpoints (e.g., Endpoint.GENERATE/BATCH_EXEC in scripts/gemini-webapi/constants.ts and client.ts) and parses the returned model/candidate content (client.ts, parsing.ts) including web image URLs which it then downloads (scripts/gemini-webapi/types/image.ts and main.ts), so untrusted third‑party content (model responses, web image URLs and potentially custom "gems") is ingested and directly influences the agent's outputs/behavior.
bd5745f
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.