CtrlK
BlogDocsLog inGet started
Tessl Logo

research-refine

Turn a vague research direction into a problem-anchored, elegant, frontier-aware, implementation-oriented method plan via iterative GPT-5.4 review. Use when the user says "refine my approach", "帮我细化方案", "decompose this problem", "打磨idea", "refine research plan", "细化研究方案", or wants a concrete research method that stays simple, focused, and top-venue ready instead of a vague or overbuilt idea.

76

Quality

72%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/research-refine/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong description with excellent trigger term coverage (including bilingual terms), clear completeness with both 'what' and 'when' clauses, and a distinctive niche. The main weakness is that the specificity of concrete actions is somewhat obscured by a chain of adjectives ('problem-anchored, elegant, frontier-aware, implementation-oriented') rather than listing discrete capabilities. The mention of 'GPT-5.4 review' is unusual and potentially confusing in a Claude skill context.

Suggestions

Replace the adjective chain with 2-3 concrete action verbs (e.g., 'decompose research problems, identify methodological gaps, draft implementation plans') to improve specificity.

Reconsider the 'GPT-5.4 review' reference—it may confuse skill selection since this is a Claude skill, and could be replaced with a clearer description of the iterative refinement process.

DimensionReasoningScore

Specificity

The description names the domain (research planning) and describes the general action ('Turn a vague research direction into a problem-anchored, elegant, frontier-aware, implementation-oriented method plan via iterative GPT-5.4 review'), but the specific concrete actions are somewhat buried in adjectives rather than listing distinct steps like 'decompose problems, identify gaps, draft methodology sections'.

2 / 3

Completeness

Clearly answers both 'what' (turn vague research direction into a concrete, implementation-oriented method plan via iterative review) and 'when' (explicit 'Use when...' clause with multiple trigger phrases and a description of the scenario: wanting concrete research methods instead of vague or overbuilt ideas).

3 / 3

Trigger Term Quality

Excellent coverage of natural trigger terms in both English and Chinese: 'refine my approach', '帮我细化方案', 'decompose this problem', '打磨idea', 'refine research plan', '细化研究方案'. These are phrases users would naturally say, and the bilingual coverage is a strength.

3 / 3

Distinctiveness Conflict Risk

The description carves out a very specific niche: research method refinement with iterative AI review, targeting top-venue readiness. The bilingual triggers and specific phrases like 'refine my approach' and '打磨idea' make it highly distinctive and unlikely to conflict with general coding, writing, or other research skills.

3 / 3

Total

11

/

12

Passed

Implementation

55%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The skill is remarkably thorough and actionable with excellent workflow clarity, checkpoint recovery, and concrete MCP tool call templates. However, it is severely over-long for a SKILL.md file — the inline templates, repeated principles, and exhaustive output format specifications make it a monolithic document that would consume enormous context window space. The content would benefit greatly from splitting templates and prompts into referenced files while keeping the SKILL.md as a concise overview.

Suggestions

Extract the reviewer prompt templates (Phase 2 and Phase 4) into a separate file like `refine-prompts.md` and reference them from SKILL.md — these alone account for ~100 lines.

Move the proposal template structure (Phase 1.6), refinement template (Phase 3.2), and all final report templates (Phase 5) into a `refine-templates.md` reference file.

Consolidate the four principles, Key Rules section, and repeated guidance (e.g., 'smallest adequate mechanism', 'anchor first') into a single concise principles block — currently the same ideas appear 3-4 times throughout the document.

Remove or drastically shorten the checkpoint recovery section (Initialization) — the JSON schema and resume table could be in a reference file, with just a one-line instruction to 'check and resume from REFINE_STATE.json per recovery-protocol.md'.

DimensionReasoningScore

Conciseness

This skill is extremely verbose at ~500+ lines. It over-explains every phase, includes extensive template structures that could be referenced externally, repeats the same principles multiple times (e.g., 'smallest adequate mechanism' appears in principles, Phase 1, Phase 3, and Key Rules), and includes detailed JSON schemas and markdown templates inline that bloat the content significantly. Much of this could be compressed or split into reference files.

1 / 3

Actionability

The skill provides highly concrete, executable guidance: exact MCP tool calls with full prompt templates, specific JSON schemas for state persistence, complete markdown templates for every output file, explicit stop conditions, and detailed phase-by-phase instructions with specific field values and scoring weights. An implementer could follow this step-by-step.

3 / 3

Workflow Clarity

The multi-step workflow is exceptionally well-sequenced with explicit phase boundaries, checkpoint recovery logic with a clear resume table, stop conditions (score threshold + verdict + drift check), and feedback loops (Phase 3-4 iterate until convergence). Validation is built into every round via anchor checks and simplicity checks before making changes.

3 / 3

Progressive Disclosure

This is a monolithic wall of text with everything inline. The full reviewer prompt templates, all output file templates, the complete proposal structure, checkpoint recovery logic, and detailed phase instructions are all in a single file. The proposal template alone (~40 lines), reviewer prompts (~50+ lines each), and output file templates (~100+ lines) should be in separate referenced files. The skill mentions composing with other skills but doesn't split its own content.

1 / 3

Total

8

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

skill_md_line_count

SKILL.md is long (736 lines); consider splitting into references/ and linking

Warning

allowed_tools_field

'allowed-tools' contains unusual tool name(s)

Warning

Total

9

/

11

Passed

Repository
wanshuiyin/Auto-claude-code-research-in-sleep
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.