paper-figure

Generate publication-quality figures and tables from experiment results. Use when user says "画图", "作图", "generate figures", "paper figures", or needs plots for a paper.

Quality

77%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/paper-figure/SKILL.md

Quality

Discovery

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a solid description that clearly communicates its purpose and includes explicit trigger guidance with bilingual keywords. Its main weakness is that the 'what' portion could be more specific about the concrete types of figures and tables it can generate. Overall, it performs well for skill selection in a multi-skill environment.

Suggestions

Add more specific concrete actions, e.g., 'Create bar charts, line plots, heatmaps, scatter plots, and formatted statistical tables from experiment results.'

Dimension	Reasoning	Score
Specificity	It names the domain (publication-quality figures and tables from experiment results) and mentions some actions (generate figures, plots), but doesn't list multiple specific concrete actions like 'create bar charts, line plots, heatmaps, format tables with statistical annotations'.	2 / 3
Completeness	Clearly answers both 'what' (generate publication-quality figures and tables from experiment results) and 'when' (explicit 'Use when' clause with specific trigger phrases and a general condition about needing plots for a paper).	3 / 3
Trigger Term Quality	Includes both Chinese ('画图', '作图') and English natural keywords ('generate figures', 'paper figures', 'plots for a paper') that users would naturally say. Good coverage of bilingual trigger terms.	3 / 3
Distinctiveness Conflict Risk	The combination of 'publication-quality', 'experiment results', 'paper figures', and Chinese trigger terms creates a clear niche that is unlikely to conflict with general plotting or data visualization skills.	3 / 3
	Total	11 / 12 Passed

Implementation

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-crafted skill with strong actionability — the executable code examples, LaTeX snippets, and concrete workflow steps make it immediately useful for generating publication-quality figures. Its main weaknesses are moderate verbosity (duplicate reference tables, verbose scope section, acknowledgements) and the lack of explicit error recovery steps in the workflow. The content would benefit from splitting detailed examples into separate reference files given its length.

Suggestions

Add explicit error handling/feedback loops in Step 5: what to do when a script fails, how to debug common matplotlib issues, and a retry pattern.

Remove the duplicate 'Figure Type Reference' table at the bottom since Step 3's decision tree already covers the same information, or consolidate them into one.

Split detailed code examples (bar charts, line plots, tables) into a separate EXAMPLES.md file and keep only one representative example in the main skill body to improve conciseness and progressive disclosure.

Fix the inconsistency between REVIEWER_MODEL constant (gpt-5.5) and the Step 7 text mentioning 'GPT-5.4'.

Dimension	Reasoning	Score
Conciseness	The skill is fairly comprehensive but includes some unnecessary content: the scope table explaining what it can/cannot do is somewhat verbose, the acknowledgements section adds no actionable value, and the figure type reference table at the bottom largely duplicates the decision tree in Step 3. The constants section includes a 'REVIEWER_MODEL = gpt-5.5' but Step 7 references 'GPT-5.4' inconsistently. However, most content is actionable rather than explanatory.	2 / 3
Actionability	The skill provides fully executable Python code for multiple figure types (line plots, bar charts), complete LaTeX snippets for tables and figure includes, a ready-to-use style configuration script, and concrete bash commands for running scripts. Examples are copy-paste ready with realistic data patterns.	3 / 3
Workflow Clarity	The 8-step workflow is clearly sequenced and covers the full pipeline from plan reading to quality review. However, there are no explicit validation/feedback loops for error recovery — Step 5 says 'verify all output files exist' but doesn't specify what to do if they don't. The quality checklist in Step 8 is good but is a post-hoc check rather than an integrated validation checkpoint. For a workflow involving file generation and script execution, the lack of error handling caps this at 2.	2 / 3
Progressive Disclosure	The content is well-structured with clear sections and headers, but it's quite long (~250 lines) and monolithic — the detailed code examples for each figure type, the scope table, and the duplicate reference tables could be split into separate files. No bundle files are provided, so there's no progressive disclosure to external references despite the content length warranting it.	2 / 3
	Total	9 / 12 Passed

Validation

81%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 9 / 11 Passed

Validation for skill structure

Criteria	Description	Result
allowed_tools_field	'allowed-tools' contains unusual tool name(s)	Warning
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	9 / 11 Passed

Repository: wanshuiyin/Auto-claude-code-research-in-sleep
Commit: a425a71

Reviewed: 4 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.