Turn a vague research direction into a problem-anchored, elegant, frontier-aware, implementation-oriented method plan via iterative GPT-5.4 review. Use when the user says "refine my approach", "帮我细化方案", "decompose this problem", "打磨idea", "refine research plan", "细化研究方案", or wants a concrete research method that stays simple, focused, and top-venue ready instead of a vague or overbuilt idea.
76
72%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/research-refine/SKILL.mdQuality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong description with excellent trigger term coverage (including bilingual terms), clear completeness with both 'what' and 'when' clauses, and a distinctive niche. The main weakness is that the specificity of concrete actions is somewhat obscured by a chain of adjectives ('problem-anchored, elegant, frontier-aware, implementation-oriented') rather than listing discrete capabilities. The mention of 'GPT-5.4 review' is unusual and potentially confusing in a Claude skill context.
Suggestions
Replace the adjective chain with 2-3 concrete action verbs (e.g., 'decompose research problems, identify methodological gaps, draft implementation plans') to improve specificity.
Reconsider the 'GPT-5.4 review' reference—it may confuse skill selection since this is a Claude skill, and could be replaced with a clearer description of the iterative refinement process.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain (research planning) and describes the general action ('Turn a vague research direction into a problem-anchored, elegant, frontier-aware, implementation-oriented method plan via iterative GPT-5.4 review'), but the specific concrete actions are somewhat buried in adjectives rather than listing distinct steps like 'decompose problems, identify gaps, draft methodology sections'. | 2 / 3 |
Completeness | Clearly answers both 'what' (turn vague research direction into a concrete, implementation-oriented method plan via iterative review) and 'when' (explicit 'Use when...' clause with multiple trigger phrases and a description of the scenario: wanting concrete research methods instead of vague or overbuilt ideas). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms in both English and Chinese: 'refine my approach', '帮我细化方案', 'decompose this problem', '打磨idea', 'refine research plan', '细化研究方案'. These are phrases users would naturally say, and the bilingual coverage is a strength. | 3 / 3 |
Distinctiveness Conflict Risk | The description carves out a very specific niche: research method refinement with iterative AI review, targeting top-venue readiness. The bilingual triggers and specific phrases like 'refine my approach' and '打磨idea' make it highly distinctive and unlikely to conflict with general coding, writing, or other research skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
55%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
The skill is remarkably thorough and actionable with excellent workflow clarity, checkpoint recovery, and concrete MCP tool call templates. However, it is severely over-long for a SKILL.md file — the inline templates, repeated principles, and exhaustive output format specifications make it a monolithic document that would consume enormous context window space. The content would benefit greatly from splitting templates and prompts into referenced files while keeping the SKILL.md as a concise overview.
Suggestions
Extract the reviewer prompt templates (Phase 2 and Phase 4) into a separate file like `refine-prompts.md` and reference them from SKILL.md — these alone account for ~100 lines.
Move the proposal template structure (Phase 1.6), refinement template (Phase 3.2), and all final report templates (Phase 5) into a `refine-templates.md` reference file.
Consolidate the four principles, Key Rules section, and repeated guidance (e.g., 'smallest adequate mechanism', 'anchor first') into a single concise principles block — currently the same ideas appear 3-4 times throughout the document.
Remove or drastically shorten the checkpoint recovery section (Initialization) — the JSON schema and resume table could be in a reference file, with just a one-line instruction to 'check and resume from REFINE_STATE.json per recovery-protocol.md'.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | This skill is extremely verbose at ~500+ lines. It over-explains every phase, includes extensive template structures that could be referenced externally, repeats the same principles multiple times (e.g., 'smallest adequate mechanism' appears in principles, Phase 1, Phase 3, and Key Rules), and includes detailed JSON schemas and markdown templates inline that bloat the content significantly. Much of this could be compressed or split into reference files. | 1 / 3 |
Actionability | The skill provides highly concrete, executable guidance: exact MCP tool calls with full prompt templates, specific JSON schemas for state persistence, complete markdown templates for every output file, explicit stop conditions, and detailed phase-by-phase instructions with specific field values and scoring weights. An implementer could follow this step-by-step. | 3 / 3 |
Workflow Clarity | The multi-step workflow is exceptionally well-sequenced with explicit phase boundaries, checkpoint recovery logic with a clear resume table, stop conditions (score threshold + verdict + drift check), and feedback loops (Phase 3-4 iterate until convergence). Validation is built into every round via anchor checks and simplicity checks before making changes. | 3 / 3 |
Progressive Disclosure | This is a monolithic wall of text with everything inline. The full reviewer prompt templates, all output file templates, the complete proposal structure, checkpoint recovery logic, and detailed phase instructions are all in a single file. The proposal template alone (~40 lines), reviewer prompts (~50+ lines each), and output file templates (~100+ lines) should be in separate referenced files. The skill mentions composing with other skills but doesn't split its own content. | 1 / 3 |
Total | 8 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (736 lines); consider splitting into references/ and linking | Warning |
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
Total | 9 / 11 Passed | |
dc00dfb
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.