arxiv

Search, download, and summarize academic papers from arXiv. Use when user says "search arxiv", "download paper", "fetch arxiv", "arxiv search", "get paper pdf", or wants to find and save papers from arXiv to the local paper library.

Quality

77%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/arxiv/SKILL.md

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong skill description that clearly communicates its purpose, lists concrete actions, and provides explicit trigger guidance with natural user phrases. It uses proper third-person voice and covers both the 'what' and 'when' comprehensively. The arXiv-specific focus makes it highly distinctive and unlikely to conflict with other skills.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: 'Search, download, and summarize academic papers from arXiv.' These are clear, distinct operations on a well-defined domain.	3 / 3
Completeness	Clearly answers both what ('Search, download, and summarize academic papers from arXiv') and when ('Use when user says "search arxiv", "download paper"...or wants to find and save papers from arXiv to the local paper library'). Explicit 'Use when' clause with specific triggers.	3 / 3
Trigger Term Quality	Includes excellent natural trigger terms users would actually say: 'search arxiv', 'download paper', 'fetch arxiv', 'arxiv search', 'get paper pdf', plus the broader 'find and save papers from arXiv'. Good coverage of variations.	3 / 3
Distinctiveness Conflict Risk	Very clear niche focused specifically on arXiv papers. The domain (arXiv), actions (search, download, summarize), and trigger terms are all highly specific and unlikely to conflict with other skills like general web search or document processing.	3 / 3
	Total	12 / 12 Passed

Implementation

55%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The skill is highly actionable with executable code and a well-structured multi-step workflow with proper validation checkpoints. However, it is excessively verbose—the repeated bash resolution chains, full inline Python fallback scripts, and wiki integration logic bloat the content significantly. The monolithic structure with everything inlined rather than split into supporting files hurts both conciseness and progressive disclosure.

Suggestions

Extract the inline Python fallback scripts and bash resolution chains into separate bundle files (e.g., `fallback_search.py`, `resolve_tool.sh`) and reference them, reducing the main SKILL.md to ~80 lines.

Move the wiki integration (Step 6) into a separate file since it's a conditional side-effect, not core functionality—a one-line reference like 'See [WIKI_INTEGRATION.md](WIKI_INTEGRATION.md) when research-wiki/ exists' would suffice.

Consolidate the two nearly-identical tool resolution bash blocks (ARXIV_FETCHER and WIKI_SCRIPT) into a single shared pattern or reference, eliminating the duplication.

Remove the explanatory prose around the resolution policy references (e.g., 'Policy D1 — primary + fallback cascade', 'Variant B — warn-and-skip')—Claude can follow the referenced contract directly without the inline commentary.

Dimension	Reasoning	Score
Conciseness	The skill is extremely verbose at ~200+ lines. It includes lengthy bash resolution chains repeated twice (ARXIV_FETCHER and WIKI_SCRIPT), full inline Python fallback scripts, and extensive boilerplate. Much of this could be condensed significantly—Claude doesn't need the full canonical resolution bash scripts spelled out twice, nor does it need the complete inline Python heredoc when the pattern is straightforward.	1 / 3
Actionability	The skill provides fully executable code throughout—complete Python scripts for searching and downloading, specific bash commands, concrete output formats, and clear examples of argument parsing. The code is copy-paste ready with real arXiv API endpoints and proper error handling patterns.	3 / 3
Workflow Clarity	The workflow is clearly sequenced across 7 steps with explicit validation checkpoints: file size verification (>10KB), rate limiting with retry logic, skip-if-exists checks, and clear error recovery paths (fallback scripts, retry on 429, suggest alternative skill if API unreachable). The feedback loop for download validation is well-defined.	3 / 3
Progressive Disclosure	The skill is a monolithic wall of content with no separation of concerns. The full inline Python fallback scripts, bash resolution chains, and wiki integration logic are all inlined rather than being split into referenced files. It references shared-references files (integration-contract.md, wiki-helper-resolution.md) but no bundle files are provided, and the main content that could be in separate files (fallback scripts, wiki integration) is all inline, making the skill overwhelming.	1 / 3
	Total	8 / 12 Passed

Validation

81%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 9 / 11 Passed

Validation for skill structure

Criteria	Description	Result
allowed_tools_field	'allowed-tools' contains unusual tool name(s)	Warning
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	9 / 11 Passed

Repository: wanshuiyin/Auto-claude-code-research-in-sleep
Commit: a425a71

Reviewed: 2 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.