research-refine

Turn a vague research direction into a problem-anchored, elegant, frontier-aware, implementation-oriented method plan via iterative GPT-5.4 review. Use when the user says "refine my approach", "帮我细化方案", "decompose this problem", "打磨idea", "refine research plan", "细化研究方案", or wants a concrete research method that stays simple, focused, and top-venue ready instead of a vague or overbuilt idea.

Quality

72%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/research-refine/SKILL.md

Quality

Discovery

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong description with excellent trigger term coverage (bilingual), clear completeness with both 'what' and 'when' clauses, and a distinctive niche. The main weakness is that the specificity of concrete actions is somewhat obscured by a string of adjectives ('problem-anchored, elegant, frontier-aware, implementation-oriented') rather than listing discrete capabilities. The mention of 'GPT-5.4 review' is unusual and potentially confusing.

Suggestions

Replace the chain of adjectives with 2-3 concrete action verbs (e.g., 'decompose research problems, identify methodological gaps, draft implementation plans') to improve specificity.

Clarify or remove the 'GPT-5.4 review' reference, which is ambiguous and could confuse skill selection—specify the actual review mechanism instead.

Dimension	Reasoning	Score
Specificity	The description names the domain (research planning) and describes the general action ('Turn a vague research direction into a problem-anchored, elegant, frontier-aware, implementation-oriented method plan via iterative GPT-5.4 review'), but the specific concrete actions are somewhat buried in adjectives rather than listing distinct steps like 'decompose problems, identify gaps, draft methodology sections'.	2 / 3
Completeness	Clearly answers both 'what' (turn vague research direction into a concrete, implementation-oriented method plan) and 'when' (explicit 'Use when' clause with multiple trigger phrases and a description of the scenario: 'wants a concrete research method that stays simple, focused, and top-venue ready instead of a vague or overbuilt idea').	3 / 3
Trigger Term Quality	Excellent coverage of natural trigger terms in both English and Chinese: 'refine my approach', '帮我细化方案', 'decompose this problem', '打磨idea', 'refine research plan', '细化研究方案'. These are phrases users would naturally say, and the bilingual coverage is a strength.	3 / 3
Distinctiveness Conflict Risk	The description carves out a very specific niche: research method refinement with iterative review, targeting academic/frontier research planning. The bilingual triggers and specific phrases like 'top-venue ready' and '打磨idea' make it highly distinctive and unlikely to conflict with general coding or writing skills.	3 / 3
	Total	11 / 12 Passed

Implementation

55%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill is highly actionable and has excellent workflow clarity with proper checkpoints, feedback loops, and recovery mechanisms. However, it is severely over-long and monolithic — the ~500+ lines include repeated principles, inline templates, and full reviewer prompts that should be split into separate reference files. The content would be far more effective at roughly one-third its current length with templates and prompts externalized.

Suggestions

Extract the large reviewer prompt templates (Phase 2 and Phase 4) into separate reference files (e.g., `reviewer-prompt.md`) and reference them from the main skill, reducing inline verbosity by ~30%.

Move the detailed output file templates (REVIEW_SUMMARY.md, FINAL_PROPOSAL.md, REFINEMENT_REPORT.md, round-N-refinement.md structures) into a single `templates/` reference file, keeping only a brief description and file list in the main skill.

Consolidate the four principles stated in the Overview with the Key Rules section — they substantially overlap (anchor first, one contribution, smallest mechanism, modern leverage appear in both places).

Move the checkpoint recovery logic and state schema into a separate `checkpoint-recovery.md` reference file, keeping only a one-line reference in the main workflow.

Dimension	Reasoning	Score
Conciseness	The skill is extremely verbose at ~500+ lines. It explains concepts Claude already knows (what a Problem Anchor is, how to parse JSON, how to write markdown files), repeats the same principles multiple times (anchor check, simplicity check, contribution focus appear in overview, constants, workflow, key rules, and templates), and includes massive template structures that could be referenced from separate files. The checkpoint recovery section alone is substantial and could be a separate reference. Every section is padded with explanations that an intelligent model doesn't need.	1 / 3
Actionability	The skill provides highly concrete, executable guidance: exact MCP tool calls with full prompt text, specific JSON schemas for state files, complete markdown templates for every output file, explicit stop conditions with numeric thresholds, and precise file naming conventions. An implementer could follow this step-by-step without ambiguity.	3 / 3
Workflow Clarity	The multi-phase workflow is clearly sequenced (Phase 0→1→2→3→4→loop→5) with explicit checkpoints after each phase, a well-defined stop condition (score >= 9 AND READY AND no drift), checkpoint recovery with a detailed resume table, and feedback loops (Phase 3-4 iterate until threshold or max rounds). Validation is built into every round via the anchor check and simplicity check.	3 / 3
Progressive Disclosure	The skill is a monolithic wall of text with no bundle files to offload content to. The massive reviewer prompt templates, output file templates, checkpoint recovery logic, and detailed proposal structure could all be split into separate reference files. References to shared protocols (output-versioning.md, output-manifest.md, output-language.md) exist but the bulk of the content that should be in separate files is inline. The skill would benefit enormously from splitting templates and prompts into referenced files.	1 / 3
	Total	8 / 12 Passed

Validation

81%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 9 / 11 Passed

Validation for skill structure

Criteria	Description	Result
skill_md_line_count	SKILL.md is long (743 lines); consider splitting into references/ and linking	Warning
allowed_tools_field	'allowed-tools' contains unusual tool name(s)	Warning

	Total	9 / 11 Passed

Repository: wanshuiyin/Auto-claude-code-research-in-sleep
Commit: a425a71

Reviewed: 1 day ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.