CtrlK
BlogDocsLog inGet started
Tessl Logo

arn-infra-refresh

This skill should be used when the user says "refresh infra references", "update tool versions", "check for new MCPs", "update infra patterns", "refresh registries", "arn infra refresh", "infra refresh", "update references", "check for updates", "refresh infrastructure tools", "update MCP servers", "refresh CLI versions", "update base images", "arn-infra-refresh", or wants to update the version-sensitive infrastructure reference files (tool versions, MCP packages, CLI commands, base image tags, IaC patterns) using online research.

74

Quality

68%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/arn-infra/skills/arn-infra-refresh/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description excels at trigger term coverage and completeness, providing an exhaustive list of phrases that would activate the skill and clearly stating both what it does and when to use it. Its main weakness is that the 'what' portion could be more specific about the concrete actions performed beyond just 'updating' references. The description is functional and effective for skill selection but reads more like a trigger list than a balanced capability description.

Suggestions

Add 2-3 specific concrete actions describing how the skill works, e.g., 'Researches latest stable versions online, updates pinned version references in configuration files, and validates compatibility across the infrastructure stack.'

DimensionReasoningScore

Specificity

The description mentions some concrete domains (tool versions, MCP packages, CLI commands, base image tags, IaC patterns) but doesn't list specific actions beyond 'update' and 'refresh'. It names what kinds of references are updated but lacks detail on how (e.g., 'researches latest versions online, updates reference files, validates compatibility').

2 / 3

Completeness

The description explicitly answers both 'what' (update version-sensitive infrastructure reference files including tool versions, MCP packages, CLI commands, base image tags, IaC patterns using online research) and 'when' (with a comprehensive list of trigger phrases prefaced by 'This skill should be used when'). The 'when' clause is very explicit.

3 / 3

Trigger Term Quality

The description includes an extensive list of natural trigger phrases users would say, covering many variations like 'refresh infra references', 'update tool versions', 'check for new MCPs', 'update base images', and shorthand forms like 'arn-infra-refresh'. This provides excellent coverage of how users would naturally invoke this skill.

3 / 3

Distinctiveness Conflict Risk

The skill occupies a very specific niche — refreshing infrastructure reference files with version-sensitive data using online research. The trigger terms are highly specific (e.g., 'arn infra refresh', 'update MCP servers', 'refresh CLI versions') and unlikely to conflict with general coding or document skills.

3 / 3

Total

11

/

12

Passed

Implementation

47%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The skill has excellent workflow clarity with well-defined steps, validation checkpoints, and comprehensive error handling. However, it is significantly over-verbose — the three experience-level UI variants, full message templates, and exhaustive error cases make it very long without proportional value. Actionability is moderate: paths and structures are specified but lack concrete executable examples (e.g., actual JSON schemas, shell commands for checksums).

Suggestions

Reduce verbosity by consolidating the three experience-level scope selection variants into a compact table or decision matrix rather than spelling out each full dialogue flow.

Add a concrete example of the .reference-checksums.json structure so Claude knows the exact schema it's reading and writing.

Move the detailed error handling cases and output message templates to a separate reference file (e.g., error-handling.md) to reduce the main skill's token footprint.

Replace the verbose researcher agent invocation template with a more concise specification, trusting Claude to construct the prompt appropriately.

DimensionReasoningScore

Conciseness

The skill is extremely verbose at ~300+ lines with extensive templated output messages, detailed UI flows for three experience levels, and exhaustive error handling cases. Much of this could be condensed significantly — Claude doesn't need full message templates spelled out verbatim, and the three experience-level variants of scope selection are repetitive. The error handling section alone has 10 cases with full prose descriptions.

1 / 3

Actionability

The skill provides a clear multi-step workflow with specific file paths, JSON field names, and agent invocation patterns. However, there is no executable code — no actual SHA-256 computation commands, no concrete examples of the checksums JSON structure, and the researcher agent invocation is a text template rather than executable. Key details like the actual format of .reference-checksums.json are described but never shown.

2 / 3

Workflow Clarity

The 6-step workflow is clearly sequenced with explicit validation checkpoints: checksum comparison before writes (Step 5.3), conflict detection with user prompts, reconciliation of new files in Step 1.3, and a comprehensive error handling section covering 10 failure modes with recovery actions. The feedback loop for checksum mismatches is well-defined.

3 / 3

Progressive Disclosure

The skill references external files appropriately (reference-manifest.md, research-strategies.md, experience-derivation.md) with clear paths, but the SKILL.md itself is monolithic — all three experience-level flows, all error cases, and all output templates are inline rather than being split into separate reference files. The body content that could be externalized (e.g., error handling, output templates) bloats the main file.

2 / 3

Total

8

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
AppsVortex/arness
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.