Search, download, and summarize academic papers from arXiv. Use when user says "search arxiv", "download paper", "fetch arxiv", "arxiv search", "get paper pdf", or wants to find and save papers from arXiv to the local paper library.
64
77%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/arxiv/SKILL.mdQuality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong skill description that clearly communicates its purpose, lists concrete actions, and provides explicit trigger guidance with natural user phrases. It uses proper third-person voice and covers both the 'what' and 'when' comprehensively. The arXiv-specific focus makes it highly distinctive and unlikely to conflict with other skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'Search, download, and summarize academic papers from arXiv.' These are clear, distinct operations on a well-defined domain. | 3 / 3 |
Completeness | Clearly answers both what ('Search, download, and summarize academic papers from arXiv') and when ('Use when user says "search arxiv", "download paper"...or wants to find and save papers from arXiv to the local paper library'). Explicit 'Use when' clause with specific triggers. | 3 / 3 |
Trigger Term Quality | Includes excellent natural trigger terms users would actually say: 'search arxiv', 'download paper', 'fetch arxiv', 'arxiv search', 'get paper pdf', plus the broader 'find and save papers from arXiv'. Good coverage of variations. | 3 / 3 |
Distinctiveness Conflict Risk | Very clear niche focused specifically on arXiv papers. The domain (arXiv), actions (search, download, summarize), and trigger terms are all highly specific and unlikely to conflict with other skills like general web search or document processing. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
55%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
The skill is highly actionable with executable code and a well-structured multi-step workflow with proper validation checkpoints. However, it is excessively verbose—the repeated bash resolution chains, full inline Python fallback scripts, and wiki integration logic bloat the content significantly. The monolithic structure with everything inlined rather than split into supporting files hurts both conciseness and progressive disclosure.
Suggestions
Extract the inline Python fallback scripts and bash resolution chains into separate bundle files (e.g., `fallback_search.py`, `resolve_tool.sh`) and reference them, reducing the main SKILL.md to ~80 lines.
Move the wiki integration (Step 6) into a separate file since it's a conditional side-effect, not core functionality—a one-line reference like 'See [WIKI_INTEGRATION.md](WIKI_INTEGRATION.md) when research-wiki/ exists' would suffice.
Consolidate the two nearly-identical tool resolution bash blocks (ARXIV_FETCHER and WIKI_SCRIPT) into a single shared pattern or reference, eliminating the duplication.
Remove the explanatory prose around the resolution policy references (e.g., 'Policy D1 — primary + fallback cascade', 'Variant B — warn-and-skip')—Claude can follow the referenced contract directly without the inline commentary.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~200+ lines. It includes lengthy bash resolution chains repeated twice (ARXIV_FETCHER and WIKI_SCRIPT), full inline Python fallback scripts, and extensive boilerplate. Much of this could be condensed significantly—Claude doesn't need the full canonical resolution bash scripts spelled out twice, nor does it need the complete inline Python heredoc when the pattern is straightforward. | 1 / 3 |
Actionability | The skill provides fully executable code throughout—complete Python scripts for searching and downloading, specific bash commands, concrete output formats, and clear examples of argument parsing. The code is copy-paste ready with real arXiv API endpoints and proper error handling patterns. | 3 / 3 |
Workflow Clarity | The workflow is clearly sequenced across 7 steps with explicit validation checkpoints: file size verification (>10KB), rate limiting with retry logic, skip-if-exists checks, and clear error recovery paths (fallback scripts, retry on 429, suggest alternative skill if API unreachable). The feedback loop for download validation is well-defined. | 3 / 3 |
Progressive Disclosure | The skill is a monolithic wall of content with no separation of concerns. The full inline Python fallback scripts, bash resolution chains, and wiki integration logic are all inlined rather than being split into referenced files. It references shared-references files (integration-contract.md, wiki-helper-resolution.md) but no bundle files are provided, and the main content that could be in separate files (fallback scripts, wiki integration) is all inline, making the skill overwhelming. | 1 / 3 |
Total | 8 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 9 / 11 Passed | |
a425a71
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.