Orchestrate parallel scientist agents for comprehensive analysis with AUTO mode
25
17%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Risky
Do not use without reviewing
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/sciomc/SKILL.mdQuality
Discovery
7%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This description is too vague and jargon-heavy to be effective for skill selection. It fails to specify what concrete actions are performed, what domain it applies to, or when Claude should select it. The use of internal terminology like 'AUTO mode' and 'scientist agents' without explanation makes it nearly unusable for matching user requests.
Suggestions
Add a 'Use when...' clause specifying the scenarios or user requests that should trigger this skill, e.g., 'Use when the user requests multi-faceted scientific analysis, parallel experimentation, or automated research workflows.'
Replace vague phrases like 'comprehensive analysis' with specific concrete actions, e.g., 'Runs multiple independent analysis pipelines in parallel, aggregates results, and synthesizes findings across datasets.'
Include natural trigger terms that users would actually say, such as 'run experiments', 'analyze data in parallel', 'multi-agent analysis', or domain-specific keywords relevant to the skill's purpose.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description uses vague language like 'comprehensive analysis' and 'orchestrate parallel scientist agents' without specifying what concrete actions are performed. It doesn't explain what kind of analysis, what the agents do, or what outputs are produced. | 1 / 3 |
Completeness | The description weakly addresses 'what' (orchestrate agents for analysis) but provides no 'when' clause or explicit trigger guidance. There is no 'Use when...' or equivalent, and the 'what' itself is too vague to be useful. | 1 / 3 |
Trigger Term Quality | The terms 'orchestrate', 'parallel scientist agents', and 'AUTO mode' are technical jargon that users would rarely naturally say. There are no common user-facing keywords that would help match this skill to typical user requests. | 1 / 3 |
Distinctiveness Conflict Risk | The mention of 'parallel scientist agents' and 'AUTO mode' provides some distinctiveness since these are unusual terms, but 'comprehensive analysis' is generic enough to potentially overlap with other analysis-related skills. | 2 / 3 |
Total | 5 / 12 Passed |
Implementation
27%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill attempts to document a comprehensive research orchestration system but suffers from extreme verbosity and poor content organization. While the core workflow concept is sound and the Task invocation examples provide some actionable guidance, the skill includes far too much detail inline—full JSON schemas, regex patterns, report templates, and figure embedding protocols that should be in separate reference files. The result is a ~350+ line document that would consume significant context window for information that could be condensed to under 100 lines with proper progressive disclosure.
Suggestions
Extract the report template, state.json schema, regex patterns, and figure embedding protocol into separate reference files (e.g., REPORT_TEMPLATE.md, STATE_SCHEMA.md, TAG_REFERENCE.md) and link to them from the main skill.
Condense the main SKILL.md to focus on the core workflow (decompose → execute → verify → synthesize) with one concrete example of each step, removing redundant parallel execution pattern variants.
Add explicit validation gates between workflow stages (e.g., 'Only proceed to execution when decomposition produces 3-7 stages with assigned tiers' and 'Only generate report when verification returns [VERIFIED]').
Remove the routing decision guide table with example prompts—Claude can infer complexity tiers from the simpler model routing table above it.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~350+ lines. It over-explains concepts Claude already understands (parallel execution, regex patterns, JSON schemas, markdown report templates). Much of this content—like the full state.json schema, regex extraction patterns, report template, and figure embedding protocol—could be dramatically condensed or moved to reference files. The routing decision guide and multiple parallel execution pattern examples are redundant. | 1 / 3 |
Actionability | The skill provides concrete examples of Task invocations with model parameters and structured tag formats, which is useful. However, much of the code is pseudocode or template placeholders (e.g., `{{ITERATION}}`, `{{STATE}}`) rather than truly executable code. The Task() calls use a non-standard syntax that isn't clearly tied to an actual API, and the regex patterns are illustrative but not integrated into a runnable workflow. | 2 / 3 |
Workflow Clarity | The four-stage workflow (Decomposition → Execution → Verification → Synthesis) is clearly stated, and the verification loop with cross-validation is a good feedback mechanism. However, the actual sequencing between stages lacks explicit validation checkpoints—there's no clear 'only proceed when X passes' gate between decomposition and execution, and the AUTO mode loop control is described abstractly with template variables rather than concrete decision points. | 2 / 3 |
Progressive Disclosure | This is a monolithic wall of text with no references to external files despite having extensive content that should be split out (report templates, regex patterns, state schemas, configuration details, troubleshooting). No bundle files are provided, yet the content is far too long for a single SKILL.md. The directory structure and session management details alone could fill a separate reference file. | 1 / 3 |
Total | 6 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (512 lines); consider splitting into references/ and linking | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 9 / 11 Passed | |
679b418
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.