research-refine-pipeline

Run an end-to-end workflow that chains `research-refine` and `experiment-plan`. Use when the user wants a one-shot pipeline from vague research direction to focused final proposal plus detailed experiment roadmap, or asks to "串起来", build a pipeline, do it end-to-end, or generate both the method and experiment plan together.

Quality

—

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Quality

Content

85%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

A well-structured orchestration skill with a clear phased workflow, explicit gating checkpoints, and exemplary progressive disclosure that delegates detail to sibling skills. The main weakness is mild redundancy across the Core Rule, Key Rules, and summary sections that could be tightened.

Suggestions

Consolidate the overlapping guidance in 'Core Rule' and 'Key Rules' to remove restated constraints and reduce token cost.

Trim the Phase 5 summary block and the PIPELINE_SUMMARY template where they duplicate the 'Default Outputs' and 'Composing with Other Skills' lists.

Consider whether the full inline PIPELINE_SUMMARY.md template is needed in SKILL.md or could be referenced from a template file to keep the overview leaner.

Dimension	Reasoning	Score
Conciseness	Mostly efficient and assumes Claude's competence (no concept-teaching padding), but restates the staged-vs-integrated distinction and overlapping rules across 'Core Rule', 'Key Rules', and the summary sections, so it could be tightened.	2 / 3
Actionability	Provides concrete file paths, a copy-paste-ready PIPELINE_SUMMARY.md template, explicit gate questions, and specific exit-criteria checklists — actionable guidance rather than vague direction (code absence is fine for an orchestration skill).	3 / 3
Workflow Clarity	Phases 0–5 are clearly sequenced with explicit validation checkpoints (Phase 2 Planning Gate), decision branches in Phase 0 triage, and exit-criteria checklists with feedback loops.	3 / 3
Progressive Disclosure	Stays a clear overview and defers stage-specific detail to one-level-deep, clearly signaled sibling skills ('only when needed') and shared-references protocols, with content appropriately split rather than inlined.	3 / 3
	Total	11 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

A strong, specific description that states concrete capabilities, provides explicit 'Use when' triggers with natural user phrasings (including a Chinese trigger), and is clearly distinguishable from the single-stage skills it composes. No meaningful weaknesses.

Dimension	Reasoning	Score
Specificity	Lists multiple concrete actions and deliverables — chaining `research-refine` and `experiment-plan`, producing a focused final proposal and a detailed experiment roadmap — matching the 'lists multiple specific concrete actions' anchor.	3 / 3
Completeness	Clearly answers both 'what' (chains the two workflows into proposal + experiment roadmap) and 'when' with an explicit 'Use when...' clause and multiple triggers.	3 / 3
Trigger Term Quality	Includes natural phrasings a user would actually say ('build a pipeline', 'do it end-to-end', '串起来', 'generate both the method and experiment plan together'), giving good coverage including a non-English natural trigger.	3 / 3
Distinctiveness Conflict Risk	Has a clear niche as the integrated/one-shot pipeline with distinct triggers ('串起来', 'build a pipeline', 'do it end-to-end') that differentiate it from the single-stage sibling skills.	3 / 3
	Total	12 / 12 Passed

Validation

93%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 15 / 16 Passed

Validation for skill structure

Criteria	Description	Result
relative_links	Relative link issues: 3 suspicious	Warning

	Total	15 / 16 Passed

Repository: wanshuiyin/Auto-claude-code-research-in-sleep
Commit: fe5963c

Reviewed: about 16 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.