This skill should be used when the user says "refresh infra references", "update tool versions", "check for new MCPs", "update infra patterns", "refresh registries", "arn infra refresh", "infra refresh", "update references", "check for updates", "refresh infrastructure tools", "update MCP servers", "refresh CLI versions", "update base images", "arn-infra-refresh", or wants to update the version-sensitive infrastructure reference files (tool versions, MCP packages, CLI commands, base image tags, IaC patterns) using online research.
74
68%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./plugins/arn-infra/skills/arn-infra-refresh/SKILL.mdQuality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description excels at trigger term coverage and completeness, providing an exhaustive list of phrases that would activate the skill and clearly stating both what it does and when to use it. Its main weakness is that the 'what' portion could be more specific about the concrete actions performed beyond just 'updating' references. The description is functional and effective for skill selection but reads more like a trigger list than a balanced capability description.
Suggestions
Add 2-3 specific concrete actions describing how the skill works, e.g., 'Researches latest stable versions online, updates pinned version references in configuration files, and validates compatibility across the infrastructure stack.'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description mentions some concrete domains (tool versions, MCP packages, CLI commands, base image tags, IaC patterns) but doesn't list specific actions beyond 'update' and 'refresh'. It names what kinds of references are updated but lacks detail on how (e.g., 'researches latest versions online, updates reference files, validates compatibility'). | 2 / 3 |
Completeness | The description explicitly answers both 'what' (update version-sensitive infrastructure reference files including tool versions, MCP packages, CLI commands, base image tags, IaC patterns using online research) and 'when' (with a comprehensive list of trigger phrases prefaced by 'This skill should be used when'). The 'when' clause is very explicit. | 3 / 3 |
Trigger Term Quality | The description includes an extensive list of natural trigger phrases users would say, covering many variations like 'refresh infra references', 'update tool versions', 'check for new MCPs', 'update base images', and shorthand forms like 'arn-infra-refresh'. This provides excellent coverage of how users would naturally invoke this skill. | 3 / 3 |
Distinctiveness Conflict Risk | The skill occupies a very specific niche — refreshing infrastructure reference files with version-sensitive data using online research. The trigger terms are highly specific (e.g., 'arn infra refresh', 'update MCP servers', 'refresh CLI versions') and unlikely to conflict with general coding or document skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
47%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
The skill has excellent workflow clarity with well-defined steps, validation checkpoints, and comprehensive error handling. However, it is significantly over-verbose — the three experience-level UI variants, full message templates, and exhaustive error cases make it very long without proportional value. Actionability is moderate: paths and structures are specified but lack concrete executable examples (e.g., actual JSON schemas, shell commands for checksums).
Suggestions
Reduce verbosity by consolidating the three experience-level scope selection variants into a compact table or decision matrix rather than spelling out each full dialogue flow.
Add a concrete example of the .reference-checksums.json structure so Claude knows the exact schema it's reading and writing.
Move the detailed error handling cases and output message templates to a separate reference file (e.g., error-handling.md) to reduce the main skill's token footprint.
Replace the verbose researcher agent invocation template with a more concise specification, trusting Claude to construct the prompt appropriately.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~300+ lines with extensive templated output messages, detailed UI flows for three experience levels, and exhaustive error handling cases. Much of this could be condensed significantly — Claude doesn't need full message templates spelled out verbatim, and the three experience-level variants of scope selection are repetitive. The error handling section alone has 10 cases with full prose descriptions. | 1 / 3 |
Actionability | The skill provides a clear multi-step workflow with specific file paths, JSON field names, and agent invocation patterns. However, there is no executable code — no actual SHA-256 computation commands, no concrete examples of the checksums JSON structure, and the researcher agent invocation is a text template rather than executable. Key details like the actual format of .reference-checksums.json are described but never shown. | 2 / 3 |
Workflow Clarity | The 6-step workflow is clearly sequenced with explicit validation checkpoints: checksum comparison before writes (Step 5.3), conflict detection with user prompts, reconciliation of new files in Step 1.3, and a comprehensive error handling section covering 10 failure modes with recovery actions. The feedback loop for checksum mismatches is well-defined. | 3 / 3 |
Progressive Disclosure | The skill references external files appropriately (reference-manifest.md, research-strategies.md, experience-derivation.md) with clear paths, but the SKILL.md itself is monolithic — all three experience-level flows, all error cases, and all output templates are inline rather than being split into separate reference files. The body content that could be externalized (e.g., error handling, output templates) bloats the main file. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
1fe948f
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.