Search, download, and summarize academic papers from arXiv. Use when user says "search arxiv", "download paper", "fetch arxiv", "arxiv search", "get paper pdf", or wants to find and save papers from arXiv to the local paper library.
89
88%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Advisory
Suggest reviewing before use
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong skill description that clearly defines its scope (arXiv paper search, download, and summarization), provides explicit trigger phrases covering natural user language variations, and has a well-defined niche that minimizes conflict with other skills. The description is concise, uses third-person voice, and effectively communicates both what the skill does and when it should be selected.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'Search, download, and summarize academic papers from arXiv.' These are clear, distinct operations on a well-defined domain. | 3 / 3 |
Completeness | Clearly answers both what ('Search, download, and summarize academic papers from arXiv') and when ('Use when user says "search arxiv", "download paper"...or wants to find and save papers from arXiv to the local paper library'). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms: 'search arxiv', 'download paper', 'fetch arxiv', 'arxiv search', 'get paper pdf', 'find and save papers'. These are phrases users would naturally say. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive with a clear niche focused on arXiv specifically. The trigger terms are domain-specific ('arxiv', 'paper pdf', 'paper library') and unlikely to conflict with general document or search skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-structured, highly actionable skill with clear multi-step workflows and proper validation checkpoints for downloads. Its main weakness is length—the dual-path approach (fetch script vs. inline fallback) for each step roughly doubles the code content, and some of this could be factored into referenced files. Overall it provides excellent concrete guidance that Claude can execute directly.
Suggestions
Consider moving the inline fallback Python code to a referenced file (e.g., `arxiv_fallback.py`) to reduce the SKILL.md length and improve progressive disclosure.
The script-finding bash/Python block in Step 2 could be simplified or abstracted since the pattern repeats across steps.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is fairly long with some redundancy (e.g., showing both the fetch script and fallback inline Python for every step), but most content is actionable code rather than explanation. Some tightening is possible—the script-finding logic and dual paths add bulk—but it doesn't over-explain concepts Claude already knows. | 2 / 3 |
Actionability | Provides fully executable Python and bash code for every step—search, fetch, download—with concrete URL patterns, XML parsing, file validation, and output formatting. The fallback inline Python is copy-paste ready with real API endpoints and proper namespace handling. | 3 / 3 |
Workflow Clarity | Clear 7-step sequential workflow with explicit validation checkpoints: file size verification (>10 KB), rate limiting with retry on HTTP 429, skip-if-exists logic, and error reporting. The feedback loop for failed downloads (warn, delete, retry) is well-defined. | 3 / 3 |
Progressive Disclosure | The skill is a single monolithic file with all code inline. While it references an external integration contract and suggests follow-up skills, the inline Python blocks for both the script path and fallback approach make the file quite long. The fallback code could be referenced from a separate file rather than embedded twice. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 9 / 11 Passed | |
700fbe2
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.