CtrlK
BlogDocsLog inGet started
Tessl Logo

arn-infra-wizard

This skill should be used when the user says "infra wizard", "arn infra wizard", "guided infra", "walk me through infrastructure", "infrastructure pipeline", "full infra pipeline", "infra flow", "run the infra pipeline", "guide me through infrastructure", "infra guided mode", "infrastructure wizard", "end to end infrastructure", "deploy everything", "set up my infrastructure", "infra start to finish", "arn infra guided", "run infra wizard", "set up everything", "complete infra setup", or wants to be walked through the entire Arness infrastructure pipeline in a single continuous session with guided decision gates instead of invoking each skill manually.

55

Quality

44%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/arn-infra/skills/arn-infra-wizard/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

54%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This description is heavily weighted toward trigger terms and 'when to use' guidance, which it does well, but it almost entirely neglects explaining what the skill actually does. The lack of concrete actions or capabilities means Claude would know when to select this skill but wouldn't understand what it provides. The description reads more like a keyword list than a functional description.

Suggestions

Add specific concrete actions the skill performs, e.g., 'Walks through provisioning VPCs, deploying services, configuring DNS, and setting up CI/CD pipelines in a guided step-by-step session.'

Reduce the trigger phrase list to the 5-8 most natural variations and use the saved space to describe the pipeline stages and decision gates.

Explain what 'Arness infrastructure pipeline' entails — what components, services, or resources are set up — so Claude can differentiate this from individual infrastructure skills.

DimensionReasoningScore

Specificity

The description does not list any concrete actions or capabilities. It only mentions 'the entire Arness infrastructure pipeline' and 'guided decision gates' without specifying what steps, tools, or actions are involved. There are no specific operations like 'provision VPCs', 'deploy containers', or 'configure DNS'.

1 / 3

Completeness

The 'when' is thoroughly covered with explicit trigger phrases and a 'Use when' equivalent clause. However, the 'what' is extremely weak — it only vaguely references 'the entire Arness infrastructure pipeline' and 'guided decision gates' without explaining what the skill actually does or what steps it performs.

2 / 3

Trigger Term Quality

The description includes an extensive list of natural trigger phrases users might say, covering many variations like 'infra wizard', 'deploy everything', 'set up my infrastructure', 'walk me through infrastructure', and more. These are realistic phrases a user would naturally use.

3 / 3

Distinctiveness Conflict Risk

The extensive trigger phrase list and 'Arness' branding help distinguish it, but the lack of specificity about what it actually does means it could overlap with individual infrastructure skills. Terms like 'deploy everything' and 'set up my infrastructure' are quite broad and could conflict with other deployment or infrastructure skills.

2 / 3

Total

8

/

12

Passed

Implementation

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is an ambitious orchestration skill that attempts to handle a complex multi-mode, multi-step infrastructure pipeline. Its strengths are clear step sequencing, concrete skill invocation syntax, and comprehensive error handling. Its weaknesses are significant verbosity (progress bars repeated for every step, exhaustive error enumeration), heavy reliance on unverifiable reference files for critical logic, and a monolithic structure that could benefit from further decomposition. The skill would be substantially improved by condensing repeated patterns and moving the error handling catalog to a reference file.

Suggestions

Dramatically reduce verbosity by templating the progress bar display once (e.g., 'Show progress with current step highlighted using the format from Step 2') instead of repeating the full ASCII art for every step.

Move the extensive error handling section to a reference file (e.g., references/error-handling.md) and keep only a 2-3 line summary inline, since the error catalog alone accounts for ~25% of the content.

Consolidate the mode selection, entry mode detection, and expertise-adaptive sections into a single concise decision tree or table rather than prose paragraphs with embedded logic.

Provide the bundle reference files (decision-gates.md, entry-modes.md, artifact-detection.md, full-pipeline-flow.md) so the skill can be evaluated as a complete unit — currently critical workflow logic is entirely opaque.

DimensionReasoningScore

Conciseness

The skill is extremely verbose at ~350+ lines with significant repetition. Progress bar displays are repeated for every step with minor variations. Error handling enumerates every possible failure scenario in exhaustive detail. The mode selection logic, entry mode detection, and expertise-adaptive sections all contain explanatory prose that could be dramatically condensed. Many sections re-explain concepts already covered (e.g., re-stating that sub-skills do the work, re-listing config fields).

1 / 3

Actionability

The skill provides concrete skill invocation names (e.g., `Skill: arn-infra:arn-infra-discover`) and specific file paths to read, which is good. However, it relies heavily on external reference files for critical logic (decision gates, entry modes, artifact detection, full pipeline flow) without providing any inline content, and no bundle files were provided. The actual decision gate questions, entry mode conditions, and artifact detection rules are all deferred to references we cannot verify exist. The AskUserQuestion prompts are concrete but many steps lack the specificity needed to be truly copy-paste executable.

2 / 3

Workflow Clarity

The workflow is clearly sequenced with numbered steps (0-9) and progress indicators, which is strong. However, critical validation/verification checkpoints are weak — there's no explicit validation between steps beyond the verify skill invocation. The error handling section is comprehensive but reads as a flat list rather than integrated into the workflow. The interaction between Quick mode, Full Pipeline mode, entry modes, and expertise levels creates significant branching complexity that isn't fully resolved in the workflow section (e.g., Full Pipeline flow is entirely deferred to a reference file).

2 / 3

Progressive Disclosure

The skill appropriately defers detailed logic to reference files (entry-modes.md, decision-gates.md, artifact-detection.md, full-pipeline-flow.md, ensure-config.md, experience-derivation.md), which is good progressive disclosure design. However, no bundle files were provided to verify these references exist or are well-structured. The main SKILL.md itself is still a monolithic wall of text that could benefit from better organization — the inline content (especially error handling and constraints) is extensive and could be split into references. The references are clearly signaled with read instructions, which is positive.

2 / 3

Total

7

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
AppsVortex/arness
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.