Designs and implements production-grade RAG systems by chunking documents, generating embeddings, configuring vector stores, building hybrid search pipelines, applying reranking, and evaluating retrieval quality. Use when building RAG systems, vector databases, or knowledge-grounded AI applications requiring semantic search, document retrieval, context augmentation, similarity search, or embedding-based indexing.
98
100%
Does it follow best practices?
Impact
97%
1.08xAverage score across 6 eval scenarios
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that follows best practices. It uses third person voice, lists comprehensive specific actions, includes an explicit 'Use when...' clause with natural trigger terms, and carves out a distinct technical niche. The description effectively balances technical precision with natural language that users would actually employ.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'chunking documents, generating embeddings, configuring vector stores, building hybrid search pipelines, applying reranking, and evaluating retrieval quality' - comprehensive and detailed. | 3 / 3 |
Completeness | Clearly answers both what (designs and implements RAG systems with specific techniques) AND when (explicit 'Use when...' clause with multiple trigger scenarios including RAG systems, vector databases, and knowledge-grounded AI applications). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural terms users would say: 'RAG systems', 'vector databases', 'semantic search', 'document retrieval', 'context augmentation', 'similarity search', 'embedding-based indexing' - these are terms practitioners naturally use. | 3 / 3 |
Distinctiveness Conflict Risk | Clear niche focused specifically on RAG/retrieval systems with distinct technical triggers like 'vector databases', 'embeddings', 'reranking' - unlikely to conflict with general document processing or other AI skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
100%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is an exemplary skill file that demonstrates excellent RAG architecture guidance. It combines concise, executable code examples with clear workflow sequencing and explicit validation checkpoints. The progressive disclosure through the reference table and the comprehensive MUST DO/MUST NOT DO constraints make this highly actionable for production RAG implementations.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is lean and efficient, assuming Claude's competence with RAG concepts. No unnecessary explanations of what embeddings or vector databases are—it jumps straight to actionable implementation patterns. | 3 / 3 |
Actionability | Provides fully executable Python code for all major operations (chunking, embedding, hybrid search, reranking, evaluation). Code is copy-paste ready with real library imports and complete function implementations. | 3 / 3 |
Workflow Clarity | Clear 5-step workflow with explicit validation checkpoints after each implementation section (assert statements). Includes specific pass/fail thresholds for evaluation metrics before proceeding to LLM integration. | 3 / 3 |
Progressive Disclosure | Well-structured with a reference table pointing to one-level-deep detailed guides for specific topics. Main content provides quick-start implementations while deferring deep dives to clearly signaled reference files. | 3 / 3 |
Total | 12 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
5b76101
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.