github.com/answerzhao/agent-skills
Skill | Added | Review |
|---|---|---|
image-generation Implement AI image generation capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to create images from text descriptions, generate visual content, create artwork, design assets, or build applications with AI-powered image creation. Supports multiple image sizes and returns base64 encoded images. Also includes CLI tool for quick image generation. | 69 Impact Pending No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: aad73ed | |
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale. | 95 2.48x Agent success vs baseline Impact 97% 2.48xAverage score across 10 eval scenarios Securityby Risky Do not use without reviewing Reviewed: Version: aad73ed | |
web-search Implement web search capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to search the web, retrieve current information, find relevant content, or build applications with real-time web search functionality. Returns structured search results with URLs, snippets, and metadata. | 69 Impact Pending No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: aad73ed | |
frontend-design Transform UI style requirements into production-ready frontend code with systematic design tokens, accessibility compliance, and creative execution. Use when building websites, web applications, React/Vue components, dashboards, landing pages, or any web UI requiring both design consistency and aesthetic quality. | 65 Impact Pending No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: aad73ed | |
web-reader Implement web page content extraction capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to scrape web pages, extract article content, retrieve page metadata, or build applications that process web content. Supports automatic content extraction with title, HTML, and publication time retrieval. | 73 Impact Pending No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: aad73ed | |
VLM Implement vision-based AI chat capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to analyze images, describe visual content, or create applications that combine image understanding with conversational AI. Supports image URLs and base64 encoded images for multimodal interactions. | 17 Impact Pending No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: aad73ed | |
pptx Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks | 81 1.48x Agent success vs baseline Impact 80% 1.48xAverage score across 10 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: aad73ed | |
canvas-design Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations. | 94 1.76x Agent success vs baseline Impact 99% 1.76xAverage score across 10 eval scenarios Reviewed: Version: aad73ed | |
TTS Implement text-to-speech (TTS) capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to convert text into natural-sounding speech, create audio content, build voice-enabled applications, or generate spoken audio files. Supports multiple voices, adjustable speed, and various audio formats. | 17 Impact Pending No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: aad73ed | |
docx Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction. When Claude needs to work with professional documents (.docx files) for: (1) Creating new documents, (2) Modifying or editing content, (3) Working with tracked changes, (4) Adding comments, or any other document tasks | 92 1.28x Agent success vs baseline Impact 100% 1.28xAverage score across 5 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: aad73ed | |
LLM Implement large language model (LLM) chat completions using the z-ai-web-dev-sdk. Use this skill when the user needs to build conversational AI applications, chatbots, AI assistants, or any text generation features. Supports multi-turn conversations, system prompts, and context management. | 17 Impact Pending No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: aad73ed | |
xlsx Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas | 77 1.35x Agent success vs baseline Impact 76% 1.35xAverage score across 10 eval scenarios Securityby Passed No known issues Reviewed: Version: aad73ed | |
Video Generation Implement AI-powered video generation capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to generate videos from text prompts or images, create video content programmatically, or build applications that produce video outputs. Supports asynchronous task management with status polling and result retrieval. | 17 Impact Pending No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: aad73ed | |
ASR Implement speech-to-text (ASR/automatic speech recognition) capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to transcribe audio files, convert speech to text, build voice input features, or process audio recordings. Supports base64 encoded audio files and returns accurate text transcriptions. | 17 Impact Pending No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: aad73ed |