CtrlK
BlogDocsLog inGet started
Tessl Logo

plan-task

Refine, parallelize, and verify a draft task specification into a fully planned implementation-ready task

37

Quality

36%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Fix and improve this skill with Tessl

tessl review fix ./plugins/sdd/skills/plan-task/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

55%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill excels at actionability and workflow clarity - it provides an extremely detailed, well-sequenced multi-agent orchestration workflow with concrete prompts, validation gates, and error recovery. However, it is severely undermined by its extreme verbosity (~800+ lines) and monolithic structure. The repetitive judge patterns, inline configuration templates, and duplicated structural elements waste significant token budget and could be dramatically condensed through pattern abstraction and progressive disclosure into supporting files.

Suggestions

Extract the repeated judge pattern into a single template (e.g., JUDGE_TEMPLATE.md) with per-phase rubric definitions, rather than repeating the full launch/prompt/decision-logic structure 7 times.

Move the detailed configuration parsing, argument definitions, and resolution logic into a separate CONFIG.md reference file, keeping only a brief summary in the main SKILL.md.

Extract the completion summary template and artifact tree into a separate COMPLETION_TEMPLATE.md, referenced from the main workflow.

Consolidate the phase definitions by defining the common pattern once (agent launch → capture → judge → retry logic) and then listing only the per-phase differences (agent type, model, prompt, rubric criteria) in a compact table or list format.

DimensionReasoningScore

Conciseness

This skill is extremely verbose at ~800+ lines. It extensively repeats patterns (every phase follows the same launch-agent/capture/judge template), includes massive configuration tables, full JSON todo structures, complete ASCII diagrams, and detailed retry flow diagrams. Much of this is redundant - the judge pattern could be defined once and referenced. Claude doesn't need explanations of what PDF formats are equivalent here: it doesn't need the full todo JSON template spelled out, or the complete artifact tree repeated in the completion section.

1 / 3

Actionability

The skill provides highly concrete, executable guidance throughout. Every phase has specific agent types, model selections, exact prompt templates, capture requirements, decision logic with thresholds, and bash commands. The configuration resolution pseudocode is clear and implementable. Judge rubrics include specific weights and scoring anchors.

3 / 3

Workflow Clarity

The multi-step workflow is exceptionally well-sequenced with explicit dependency chains, synchronization points, validation checkpoints (judges after every phase), retry/feedback loops with MAX_ITERATIONS limits, and clear error handling procedures. The ASCII workflow diagram, retry flow diagrams, and decision logic (PASS/FAIL/MAX_ITERATIONS) provide unambiguous guidance for every branching scenario.

3 / 3

Progressive Disclosure

This is a monolithic wall of text with no bundle files provided. The entire workflow specification, all judge rubrics, all configuration details, retry logic, error handling, and completion templates are inlined in a single massive file. The judge rubrics alone (repeated 7 times with similar structure) could be extracted to a separate reference file. Phase definitions, configuration parsing, and completion templates could all be split into separate referenced documents.

1 / 3

Total

8

/

12

Passed

Description

17%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is too abstract and process-oriented, using internal workflow jargon rather than natural user language. It lacks a 'Use when...' clause entirely, making it difficult for Claude to know when to select this skill. The actions described are vague and could overlap with other planning or task management skills.

Suggestions

Add an explicit 'Use when...' clause with natural trigger terms, e.g., 'Use when the user asks to plan, break down, or prepare a task for implementation, or when refining a draft spec into actionable work items.'

Replace abstract terms like 'refine', 'parallelize', and 'verify' with concrete actions, e.g., 'Breaks a draft task spec into parallel subtasks, validates dependencies, and produces an implementation-ready plan with clear steps.'

Include natural keywords users would say, such as 'task planning', 'break down work', 'implementation plan', 'subtasks', or 'work breakdown'.

DimensionReasoningScore

Specificity

Names some actions ('refine', 'parallelize', 'verify') and a domain ('task specification'), but these terms are somewhat abstract. It doesn't list concrete, granular actions like specific outputs or artifacts produced.

2 / 3

Completeness

It describes a vague 'what' but completely lacks a 'when' clause. There is no explicit trigger guidance telling Claude when to select this skill. Per the rubric, a missing 'Use when...' clause caps completeness at 2, and the weak 'what' brings it down to 1.

1 / 3

Trigger Term Quality

The terms 'refine', 'parallelize', and 'verify a draft task specification' are not natural phrases users would say. Users are more likely to say things like 'plan my task', 'break down this work', or 'prepare implementation steps'. The language is overly process-oriented jargon.

1 / 3

Distinctiveness Conflict Risk

The mention of 'draft task specification' and 'implementation-ready task' provides some specificity, but terms like 'refine' and 'verify' are generic enough to overlap with other planning, review, or specification skills.

2 / 3

Total

6

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

skill_md_line_count

SKILL.md is long (1224 lines); consider splitting into references/ and linking

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

9

/

11

Passed

Repository
NeoLabHQ/context-engineering-kit
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.