Discover and install skills to enhance your AI agent's capabilities.
| Name | Contains | Score |
|---|---|---|
Use when the user wants an adversarial double-check of a code or config change. Run the strongest checks available, try to break the claim, look for edge cases and hidden regressions, and return PASS, PARTIAL, or FAIL with evidence. Good triggers include "poke holes in this", "stress test this change", "double check this fix", and "try to break it". Contains: skeptic-verifier Use when the user wants an adversarial double-check of a code or config change. Run the strongest checks available such as targeted tests, edge-case repros, command runs, or API calls, try to break the claim, look for hidden regressions, and return PASS, PARTIAL, or FAIL with evidence. Good triggers include "poke holes in this", "stress test this change", "double check this fix", and "try to break it". | Skills | 84 1.30x Agent success vs baseline Impact 81% 1.30xAverage score across 8 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.1.2 |
tessl-labs/audit-logs v0.1.11 Collect and normalize agent logs, discover installed verifiers, and dispatch LLM judges to evaluate adherence. Produces per-session verdicts and aggregated reports. Contains: audit-logs Audit agent behavior by collecting logs, discovering verifiers for agent behavior, and dispatching LLM judges to evaluate rule adherence. Produces per-session verdicts and an aggregated summary. Use when you want to evaluate agent compliance, check rule adherence, or measure how well agents follow your skills and docs. create-verifiers Create structured verifiers (eval rubrics) from skills, docs, rules, or any instruction source. Produces checklist-based grading criteria that LLM judges can score against agent session logs. Use when you want to define tests for agent behavior, build assessment criteria, validate compliance, or check quality of agent responses. friction-review Detect friction points in agent sessions — errors, backtracking, user frustration, repeated failures. Dispatches haiku judges to review transcripts and produces an aggregated friction summary. Can run standalone or as part of the audit-logs pipeline. | Skills | 84 3.09x Agent success vs baseline Impact 96% 3.09xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.1.11 |
testing-strategies supercent-io/skills-template Design comprehensive testing strategies for software quality assurance. Use when planning test coverage, implementing test pyramids, or setting up testing infrastructure. Handles unit testing, integration testing, E2E testing, TDD, and testing best practices. | Skills | 84 1.14x Agent success vs baseline Impact 96% 1.14xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.0.1 |
figure-legend-gen aipoch/medical-research-skills Generate standardized figure legends for scientific charts and graphs. Trigger when user uploads/requesting legend for research figures, academic papers, or data charts. Supports bar charts, line graphs, scatter plots, box plots, heatmaps, and microscopy images. This tool generates text legends only, not visualizations. | Skills | 84 1.14x Agent success vs baseline Impact 96% 1.14xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.0.1 |
journal-cover-prompter aipoch/medical-research-skills Use when creating journal cover images, generating scientific artwork prompts, or designing graphical abstracts. Creates detailed prompts for AI image generators to produce publication-quality scientific visuals. | Skills | 84 4.00x Agent success vs baseline Impact 96% 4.00xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.0.1 |
mocking-apis jeremylongshore/claude-code-plugins-plus-skills Generate mock API servers for testing and development with realistic response data. Use when creating mock APIs for development and testing. Trigger with phrases like "create mock API", "generate API mock", or "setup mock server". | Skills | 84 1.23x Agent success vs baseline Impact 96% 1.23xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.0.2 |
eks itsmostafa/aws-agent-skills AWS EKS Kubernetes management for clusters, node groups, and workloads. Use when creating clusters, configuring IRSA, managing node groups, deploying applications, or integrating with AWS services. | Skills | 84 1.03x Agent success vs baseline Impact 84% 1.03xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: 0.0.1 |
bedrock itsmostafa/aws-agent-skills AWS Bedrock foundation models for generative AI. Use when invoking foundation models, building AI applications, creating embeddings, configuring model access, or implementing RAG patterns. | Skills | 84 1.03x Agent success vs baseline Impact 100% 1.03xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.0.1 |
agents-v2-py haniakrim21/everything-claude-code Build container-based Foundry Agents using Azure AI Projects SDK with ImageBasedHostedAgentDefinition. Use when creating hosted agents that run custom code in Azure AI Foundry with your own container images. Triggers: "ImageBasedHostedAgentDefinition", "hosted agent", "container agent", "Foundry Agent", "create_version", "ProtocolVersionRecord", "AgentProtocol.RESPONSES", "custom agent image". | Skills | 84 1.31x Agent success vs baseline Impact 100% 1.31xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: ae2cadd |
api-versioning fernandezbaptiste/rails_ai_agents Implements RESTful API design with versioning and request specs. Use when building APIs, adding API endpoints, versioning APIs, or when user mentions REST, JSON API, or API design. | Skills | 84 1.26x Agent success vs baseline Impact 100% 1.26xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 15fdeaf |
mutation-test-suite-optimizer ArabelaTso/Skills-4-SE Optimize test suites using mutation testing to maximize mutation kill rate with minimal tests. Use when you need to reduce test suite size while maintaining quality, identify redundant tests, improve mutation coverage, or validate test effectiveness. The skill analyzes test coverage, execution intervals, and redundancy using mutation operators, selects or generates a minimal subset of tests that maximizes mutation kill rate, and outputs an optimized test suite with detailed reports showing killed and surviving mutants. Supports multiple programming languages and mutation testing frameworks. | Skills | 84 1.96x Agent success vs baseline Impact 100% 1.96xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0f00a4f |
multi-version-behavior-comparator ArabelaTso/Skills-4-SE Compare behavior across multiple versions of programs or repositories. Use when you need to analyze how functionality changes between versions, identify regressions, compare outputs and exceptions, or validate upgrades. The skill compares execution behavior, test results, outputs, exceptions, and observable states across versions, generating detailed reports showing behavioral divergences, potential regressions, added/removed functionality, and areas requiring validation. Supports multiple programming languages and can work with test suites or execution traces. | Skills | 84 1.58x Agent success vs baseline Impact 100% 1.58xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0f00a4f |
agent-school jamesmoss/skills Investigates a problem area in the codebase and generates a new tessl tile (rules, docs, skills) to teach agents how to handle it correctly. Use when agents keep making the same mistakes around a library, design pattern, or convention. | Skills | 84 1.36x Agent success vs baseline Impact 100% 1.36xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: df78c2e |
api-versioning ThibautBaissac/rails_ai_agents Implements RESTful API design with versioning and request specs. Use when building APIs, adding API endpoints, versioning APIs, or when user mentions REST, JSON API, or API design. | Skills | 84 1.26x Agent success vs baseline Impact 100% 1.26xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 15fdeaf |
gemini jdrhyne/agent-skills Use when the user asks to run Gemini CLI for code review, plan review, or big context (>200k) processing. Ideal for comprehensive analysis requiring large context windows. Uses Gemini 3 Pro by default for state-of-the-art reasoning and coding. | Skills | 84 1.66x Agent success vs baseline Impact 100% 1.66xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: fae44e7 |
gemini softaworks/agent-toolkit Use when the user asks to run Gemini CLI for code review, plan review, or big context (>200k) processing. Ideal for comprehensive analysis requiring large context windows. Uses Gemini 3 Pro by default for state-of-the-art reasoning and coding. | Skills | 84 1.66x Agent success vs baseline Impact 100% 1.66xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 3027f20 |
gemini softaworks/agent-toolkit Use when the user asks to run Gemini CLI for code review, plan review, or big context (>200k) processing. Ideal for comprehensive analysis requiring large context windows. Uses Gemini 3 Pro by default for state-of-the-art reasoning and coding. | Skills | 84 1.66x Agent success vs baseline Impact 100% 1.66xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 3027f20 |
metabolomics-workbench-database K-Dense-AI/claude-scientific-skills Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery. | Skills | 84 1.57x Agent success vs baseline Impact 99% 1.57xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: 0.0.1 |
standup-meeting supercent-io/skills-template Conduct effective daily standup meetings for agile teams. Use when facilitating standups, tracking blockers, or improving team synchronization. Handles standup format, time management, and blocker resolution. | Skills | 84 1.20x Agent success vs baseline Impact 99% 1.20xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.0.1 |
langfuse-hello-world jeremylongshore/claude-code-plugins-plus-skills Create a minimal working Langfuse trace example. Use when starting a new Langfuse integration, testing your setup, or learning basic Langfuse tracing patterns. Trigger with phrases like "langfuse hello world", "langfuse example", "langfuse quick start", "first langfuse trace", "simple langfuse code". | Skills | 84 1.53x Agent success vs baseline Impact 83% 1.53xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.0.2 |
Can't find what you're looking for? Evaluate a missing skill.