Generate publication-quality figures and tables from experiment results. Use when user says "画图", "作图", "generate figures", "paper figures", or needs plots for a paper.
64
77%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/paper-figure/SKILL.mdQuality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a solid description that clearly communicates its purpose and includes explicit trigger guidance with bilingual keywords. Its main weakness is that the 'what' portion could be more specific about the concrete types of figures and tables it can generate. Overall, it performs well for skill selection in a multi-skill environment.
Suggestions
Add more specific concrete actions, e.g., 'Create bar charts, line plots, heatmaps, scatter plots, and formatted statistical tables from experiment results.'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | It names the domain (publication-quality figures and tables from experiment results) and mentions some actions (generate figures, plots), but doesn't list multiple specific concrete actions like 'create bar charts, line plots, heatmaps, format tables with statistical annotations'. | 2 / 3 |
Completeness | Clearly answers both 'what' (generate publication-quality figures and tables from experiment results) and 'when' (explicit 'Use when' clause with specific trigger phrases and a general condition about needing plots for a paper). | 3 / 3 |
Trigger Term Quality | Includes both Chinese ('画图', '作图') and English natural keywords ('generate figures', 'paper figures', 'plots for a paper') that users would naturally say. Good coverage of bilingual trigger terms. | 3 / 3 |
Distinctiveness Conflict Risk | The combination of 'publication-quality', 'experiment results', 'paper figures', and Chinese trigger terms creates a clear niche that is unlikely to conflict with general plotting or data visualization skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
64%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-crafted skill with strong actionability — the executable code examples, LaTeX snippets, and concrete workflow steps make it immediately useful for generating publication-quality figures. Its main weaknesses are moderate verbosity (duplicate reference tables, verbose scope section, acknowledgements) and the lack of explicit error recovery steps in the workflow. The content would benefit from splitting detailed examples into separate reference files given its length.
Suggestions
Add explicit error handling/feedback loops in Step 5: what to do when a script fails, how to debug common matplotlib issues, and a retry pattern.
Remove the duplicate 'Figure Type Reference' table at the bottom since Step 3's decision tree already covers the same information, or consolidate them into one.
Split detailed code examples (bar charts, line plots, tables) into a separate EXAMPLES.md file and keep only one representative example in the main skill body to improve conciseness and progressive disclosure.
Fix the inconsistency between REVIEWER_MODEL constant (gpt-5.5) and the Step 7 text mentioning 'GPT-5.4'.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is fairly comprehensive but includes some unnecessary content: the scope table explaining what it can/cannot do is somewhat verbose, the acknowledgements section adds no actionable value, and the figure type reference table at the bottom largely duplicates the decision tree in Step 3. The constants section includes a 'REVIEWER_MODEL = gpt-5.5' but Step 7 references 'GPT-5.4' inconsistently. However, most content is actionable rather than explanatory. | 2 / 3 |
Actionability | The skill provides fully executable Python code for multiple figure types (line plots, bar charts), complete LaTeX snippets for tables and figure includes, a ready-to-use style configuration script, and concrete bash commands for running scripts. Examples are copy-paste ready with realistic data patterns. | 3 / 3 |
Workflow Clarity | The 8-step workflow is clearly sequenced and covers the full pipeline from plan reading to quality review. However, there are no explicit validation/feedback loops for error recovery — Step 5 says 'verify all output files exist' but doesn't specify what to do if they don't. The quality checklist in Step 8 is good but is a post-hoc check rather than an integrated validation checkpoint. For a workflow involving file generation and script execution, the lack of error handling caps this at 2. | 2 / 3 |
Progressive Disclosure | The content is well-structured with clear sections and headers, but it's quite long (~250 lines) and monolithic — the detailed code examples for each figure type, the scope table, and the duplicate reference tables could be split into separate files. No bundle files are provided, so there's no progressive disclosure to external references despite the content length warranting it. | 2 / 3 |
Total | 9 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 9 / 11 Passed | |
a425a71
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.