extract-from-pdfs

This skill should be used when extracting structured data from scientific PDFs for systematic reviews, meta-analyses, or database creation. Use when working with collections of research papers that need to be converted into analyzable datasets with validation metrics.

Quality

83%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Advisory

Suggest reviewing before use

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured, highly actionable skill for a complex multi-step pipeline. Its greatest strength is the complete, executable pipeline commands with clear sequencing and validation steps. The main weakness is moderate verbosity—several sections duplicate information already presented (Available Scripts repeats the pipeline, Assets repeats earlier references), and the 'When to Use This Skill' section explains things Claude can infer. Trimming redundant sections and moving reference lists to bundle files would improve token efficiency.

Suggestions

Remove the 'Available Scripts' and 'Assets' sections, which duplicate information already presented in the pipeline and earlier sections—or move them to a reference file.

Trim the 'When to Use This Skill' section to just the 'Do not use for' bullets, as Claude can infer appropriate use cases from the content itself.

Dimension	Reasoning	Score
Conciseness	The skill is mostly efficient but includes some unnecessary sections like 'When to Use This Skill' (Claude can infer this), the 'Available Scripts' section which duplicates the pipeline already shown, and the 'Assets' section which largely repeats earlier references. The cost optimization and quality assurance sections add value but could be tighter.	2 / 3
Actionability	The skill provides fully executable, copy-paste ready commands for every pipeline step with specific flags and arguments. Schema customization, API configuration, and filtering customization all include concrete file paths and actionable instructions. The complete pipeline section is exemplary.	3 / 3
Workflow Clarity	The 6-step pipeline (plus 3 validation steps) is clearly sequenced with explicit step numbers, each with concrete commands. The iterative improvement section provides a feedback loop (extract → validate → analyze → revise → re-extract → compare). The validation workflow serves as an explicit checkpoint before proceeding to publication.	3 / 3
Progressive Disclosure	The skill references four guides in `references/` directory and provides `cat` commands to access them, which is good structure. However, without bundle files to verify these exist, and given that the SKILL.md itself is quite long (~200 lines) with sections like 'Available Scripts' and 'Assets' that could be in reference files, the inline content could be better distributed. Some content is duplicated between sections.	2 / 3
	Total	10 / 12 Passed

Description

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a solid description with clear 'when' guidance and strong domain-specific trigger terms that would help Claude distinguish it from general PDF or data extraction skills. Its main weakness is that the 'what' portion could be more specific about the concrete actions performed (e.g., extracting study characteristics, effect sizes, sample sizes, parsing tables). The description effectively carves out a distinct niche in scientific literature processing.

Suggestions

Add specific concrete actions like 'extracts study characteristics, effect sizes, sample sizes, and outcome measures from research papers' to improve specificity.

Dimension	Reasoning	Score
Specificity	The description names the domain (scientific PDFs, systematic reviews) and mentions some actions (extracting structured data, converting to analyzable datasets), but doesn't list multiple specific concrete actions like parsing tables, extracting citations, identifying methodology sections, etc.	2 / 3
Completeness	Clearly answers both what ('extracting structured data from scientific PDFs... converted into analyzable datasets with validation metrics') and when ('Use when working with collections of research papers that need to be converted into analyzable datasets'), with explicit trigger guidance via 'Use when' clause.	3 / 3
Trigger Term Quality	Good coverage of natural terms users would say: 'scientific PDFs', 'systematic reviews', 'meta-analyses', 'research papers', 'structured data', 'database creation', 'datasets', 'validation metrics'. These are terms researchers would naturally use when requesting this kind of help.	3 / 3
Distinctiveness Conflict Risk	The combination of scientific PDFs, systematic reviews, meta-analyses, and validation metrics creates a very clear niche that is unlikely to conflict with general PDF extraction skills or generic data processing skills.	3 / 3
	Total	11 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: brunoasm/my_claude_skills
Commit: d69f3ce

Reviewed: about 1 month ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.