Turn a vague research direction into a problem-anchored, elegant, frontier-aware, implementation-oriented method plan via iterative GPT-5.4 review. Use when the user says "refine my approach", "帮我细化方案", "decompose this problem", "打磨idea", "refine research plan", "细化研究方案", or wants a concrete research method that stays simple, focused, and top-venue ready instead of a vague or overbuilt idea.
60
72%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/research-refine/SKILL.mdQuality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong description with excellent trigger term coverage (bilingual), clear completeness with both 'what' and 'when' clauses, and a distinctive niche. The main weakness is that the specificity of concrete actions is somewhat obscured by a string of adjectives ('problem-anchored, elegant, frontier-aware, implementation-oriented') rather than listing discrete capabilities. The mention of 'GPT-5.4 review' is unusual and potentially confusing.
Suggestions
Replace the chain of adjectives with 2-3 concrete action verbs (e.g., 'decompose research problems, identify methodological gaps, draft implementation plans') to improve specificity.
Clarify or remove the 'GPT-5.4 review' reference, which is ambiguous and could confuse skill selection—specify the actual review mechanism instead.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain (research planning) and describes the general action ('Turn a vague research direction into a problem-anchored, elegant, frontier-aware, implementation-oriented method plan via iterative GPT-5.4 review'), but the specific concrete actions are somewhat buried in adjectives rather than listing distinct steps like 'decompose problems, identify gaps, draft methodology sections'. | 2 / 3 |
Completeness | Clearly answers both 'what' (turn vague research direction into a concrete, implementation-oriented method plan) and 'when' (explicit 'Use when' clause with multiple trigger phrases and a description of the scenario: 'wants a concrete research method that stays simple, focused, and top-venue ready instead of a vague or overbuilt idea'). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms in both English and Chinese: 'refine my approach', '帮我细化方案', 'decompose this problem', '打磨idea', 'refine research plan', '细化研究方案'. These are phrases users would naturally say, and the bilingual coverage is a strength. | 3 / 3 |
Distinctiveness Conflict Risk | The description carves out a very specific niche: research method refinement with iterative review, targeting academic/frontier research planning. The bilingual triggers and specific phrases like 'top-venue ready' and '打磨idea' make it highly distinctive and unlikely to conflict with general coding or writing skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
55%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill is highly actionable and has excellent workflow clarity with proper checkpoints, feedback loops, and recovery mechanisms. However, it is severely over-long and monolithic — the ~500+ lines include repeated principles, inline templates, and full reviewer prompts that should be split into separate reference files. The content would be far more effective at roughly one-third its current length with templates and prompts externalized.
Suggestions
Extract the large reviewer prompt templates (Phase 2 and Phase 4) into separate reference files (e.g., `reviewer-prompt.md`) and reference them from the main skill, reducing inline verbosity by ~30%.
Move the detailed output file templates (REVIEW_SUMMARY.md, FINAL_PROPOSAL.md, REFINEMENT_REPORT.md, round-N-refinement.md structures) into a single `templates/` reference file, keeping only a brief description and file list in the main skill.
Consolidate the four principles stated in the Overview with the Key Rules section — they substantially overlap (anchor first, one contribution, smallest mechanism, modern leverage appear in both places).
Move the checkpoint recovery logic and state schema into a separate `checkpoint-recovery.md` reference file, keeping only a one-line reference in the main workflow.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~500+ lines. It explains concepts Claude already knows (what a Problem Anchor is, how to parse JSON, how to write markdown files), repeats the same principles multiple times (anchor check, simplicity check, contribution focus appear in overview, constants, workflow, key rules, and templates), and includes massive template structures that could be referenced from separate files. The checkpoint recovery section alone is substantial and could be a separate reference. Every section is padded with explanations that an intelligent model doesn't need. | 1 / 3 |
Actionability | The skill provides highly concrete, executable guidance: exact MCP tool calls with full prompt text, specific JSON schemas for state files, complete markdown templates for every output file, explicit stop conditions with numeric thresholds, and precise file naming conventions. An implementer could follow this step-by-step without ambiguity. | 3 / 3 |
Workflow Clarity | The multi-phase workflow is clearly sequenced (Phase 0→1→2→3→4→loop→5) with explicit checkpoints after each phase, a well-defined stop condition (score >= 9 AND READY AND no drift), checkpoint recovery with a detailed resume table, and feedback loops (Phase 3-4 iterate until threshold or max rounds). Validation is built into every round via the anchor check and simplicity check. | 3 / 3 |
Progressive Disclosure | The skill is a monolithic wall of text with no bundle files to offload content to. The massive reviewer prompt templates, output file templates, checkpoint recovery logic, and detailed proposal structure could all be split into separate reference files. References to shared protocols (output-versioning.md, output-manifest.md, output-language.md) exist but the bulk of the content that should be in separate files is inline. The skill would benefit enormously from splitting templates and prompts into referenced files. | 1 / 3 |
Total | 8 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (743 lines); consider splitting into references/ and linking | Warning |
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
Total | 9 / 11 Passed | |
a425a71
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.