This skill should be used when the user says "execute plan", "run the plan", "start execution", "execute project", "execute all tasks", "run all tasks", "run plan in parallel", "execute with review gates", or wants to execute a full structured project plan after tasks have been created via arn-code-taskify. This skill runs ALL tasks from the task list — for executing a single specific task, use arn-code-execute-task instead. For Agent Teams mode, use arn-code-execute-plan-teams. Do NOT use this skill when executing a single task — use `/arn-code-execute-task` instead.
59
68%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./plugins/arn-code/skills/arn-code-execute-plan/SKILL.mdQuality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This description excels at trigger term coverage and disambiguation from related skills, making it very effective for skill selection in a large skill library. Its main weakness is that it could be more specific about what concrete actions the skill performs beyond 'running all tasks' — e.g., does it orchestrate parallel execution, manage dependencies, produce reports? The explicit negative guidance ('Do NOT use this skill when...') is a strong differentiator.
Suggestions
Add 2-3 specific concrete actions the skill performs (e.g., 'orchestrates task dependencies, manages parallel execution, enforces review gates between phases, generates execution reports') to improve specificity.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description mentions executing a full structured project plan and running all tasks from a task list, but it doesn't list specific concrete actions beyond 'run all tasks'. It references parallel execution and review gates but doesn't elaborate on what the skill actually does mechanically. | 2 / 3 |
Completeness | Clearly answers both 'what' (executes all tasks from a structured project plan created via arn-code-taskify) and 'when' (explicit trigger phrases plus clear boundary conditions distinguishing it from single-task execution and teams mode). Includes explicit 'Use when' equivalent and 'Do NOT use when' guidance. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger phrases: 'execute plan', 'run the plan', 'start execution', 'execute project', 'execute all tasks', 'run all tasks', 'run plan in parallel', 'execute with review gates'. These are phrases users would naturally say. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive with explicit disambiguation from related skills: 'arn-code-execute-task' for single tasks and 'arn-code-execute-plan-teams' for Agent Teams mode. The 'Do NOT use' clause further reduces conflict risk. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
47%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill has excellent workflow clarity with well-defined steps, validation checkpoints, and comprehensive error recovery paths. However, it suffers significantly from verbosity — the inline error handling branches, full user prompt text, and detailed procedural logic inflate the token cost far beyond what's necessary. The content would benefit greatly from offloading detailed error handling and prompt templates to reference files, keeping only the high-level flow inline.
Suggestions
Move the extensive error handling section (especially the test failure branches with full prompt text) to a reference file like `references/error-handling.md` and summarize inline with a one-line description per error type plus a pointer to the reference.
Remove or shorten the full AskUserQuestion prompt text inline — Claude can generate appropriate prompts from brief instructions like 'Ask user: resume vs restart execution, explaining progress state'.
Consolidate the prerequisite checks (Steps 1.3-1.8) into a concise checklist format rather than individual paragraphs, reducing repetitive 'if missing, suggest running X' patterns to a single table or compact list.
Consider offloading the sketch artifact detection logic (Step 1.8) to a reference file, as it contains implementation-level detail about path resolution that inflates the main workflow.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~200+ lines with extensive inline detail that could be offloaded to reference files. Many sections explain orchestration logic, error handling branches, and UI prompts in exhaustive detail that inflates token cost significantly. The error handling section alone contains multiple multi-paragraph branches with full prompt text that could be in a reference file. | 1 / 3 |
Actionability | The skill provides concrete steps, specific file paths, and exact prompts to present to users, which is good. However, the core dispatch loop (Step 4) delegates entirely to an external reference file with no inline summary of the algorithm, and there's no executable code — just directory structures and procedural descriptions. The guidance is specific but not copy-paste executable. | 2 / 3 |
Workflow Clarity | The workflow is clearly sequenced with numbered steps, explicit validation checkpoints (verify project directory, verify task list, confirm before execution), feedback loops (execute-review-gate cycle with retry/escalate), and comprehensive error handling with branching recovery paths. The dependency graph verification and review cycle max-retry escalation demonstrate strong validation discipline. | 3 / 3 |
Progressive Disclosure | The skill references external files for the dispatch loop and pattern refresh (dispatch-loop.md, pattern-refresh.md), which is good progressive disclosure. However, the massive error handling section, the detailed prompts, and the sketch artifact detection logic are all inline when they could be offloaded to reference files. The bundle files were not provided, so referenced paths cannot be verified, but the structure shows a mix of appropriate delegation and excessive inline content. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
b9084b6
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.