Designs and implements production-grade RAG systems by chunking documents, generating embeddings, configuring vector stores, building hybrid search pipelines, applying reranking, and evaluating retrieval quality. Use when building RAG systems, vector databases, or knowledge-grounded AI applications requiring semantic search, document retrieval, context augmentation, similarity search, or embedding-based indexing.
95
92%
Does it follow best practices?
Impact
97%
1.08xAverage score across 6 eval scenarios
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that clearly articulates specific capabilities (chunking, embedding, vector store configuration, hybrid search, reranking, evaluation) and provides explicit trigger guidance via a well-constructed 'Use when...' clause. The description uses proper third-person voice, covers a wide range of natural trigger terms, and occupies a distinct technical niche that minimizes conflict risk with other skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: chunking documents, generating embeddings, configuring vector stores, building hybrid search pipelines, applying reranking, and evaluating retrieval quality. These are all distinct, well-defined technical operations. | 3 / 3 |
Completeness | Clearly answers both 'what' (designs and implements production-grade RAG systems with specific sub-tasks) and 'when' (explicit 'Use when...' clause listing multiple trigger scenarios like building RAG systems, vector databases, or knowledge-grounded AI applications). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural terms users would say: 'RAG systems', 'vector databases', 'semantic search', 'document retrieval', 'context augmentation', 'similarity search', 'embedding-based indexing', 'knowledge-grounded AI applications'. These cover both common and technical variations a user might use. | 3 / 3 |
Distinctiveness Conflict Risk | Occupies a clear niche around RAG/retrieval-augmented generation systems with highly specific triggers like 'vector databases', 'reranking', 'hybrid search pipelines', and 'embedding-based indexing' that are unlikely to conflict with general coding or document processing skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
85%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a strong, well-structured RAG skill that provides executable code examples for every major pipeline component, clear workflow sequencing with validation checkpoints, and excellent progressive disclosure via a reference table. The main weakness is moderate verbosity in the constraints section, where several items state general engineering best practices that Claude already knows. Overall, it's highly actionable and well-organized for production use.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is mostly efficient with good code examples and structured sections, but includes some unnecessary guidance that Claude would already know (e.g., 'Evaluate multiple embedding models on your domain data before committing' and several MUST NOT items are general best practices). The constraints section is somewhat verbose with items that are standard engineering practice rather than RAG-specific operational guidance. | 2 / 3 |
Actionability | Provides fully executable, copy-paste-ready Python code for all major RAG components: chunking with LangChain, embedding with OpenAI, indexing with Qdrant, hybrid search with BM25+vector, reranking with Cohere, and evaluation with RAGAS. Each example includes concrete imports, function signatures, and realistic parameters. | 3 / 3 |
Workflow Clarity | The 5-step core workflow is clearly sequenced with explicit validation checkpoints after each implementation step (assert statements for metadata, deduplication, search results, and metric thresholds). The instruction 'validate before moving on' with concrete checkpoint assertions provides proper feedback loops for this multi-step pipeline. | 3 / 3 |
Progressive Disclosure | Excellent use of a reference table with clear 'Load When' conditions pointing to one-level-deep reference files for vector databases, embedding models, chunking strategies, retrieval optimization, and evaluation. The main skill provides a concise overview with executable examples while deferring detailed guidance to well-signaled reference documents. | 3 / 3 |
Total | 11 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
3d95bb1
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.