This skill should be used when the user says "refresh infra references", "update tool versions", "check for new MCPs", "update infra patterns", "refresh registries", "arn infra refresh", "infra refresh", "update references", "check for updates", "refresh infrastructure tools", "update MCP servers", "refresh CLI versions", "update base images", "arn-infra-refresh", or wants to update the version-sensitive infrastructure reference files (tool versions, MCP packages, CLI commands, base image tags, IaC patterns) using online research.
59
68%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Risky
Do not use without reviewing
Optimize this skill with Tessl
npx tessl skill review --optimize ./plugins/arn-infra/skills/arn-infra-refresh/SKILL.mdQuality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description excels at trigger term coverage and completeness, providing an exhaustive list of invocation phrases and clearly stating both what the skill does and when to use it. Its main weakness is that the capability description itself is somewhat general — it says 'update' reference files but doesn't enumerate the specific actions taken (e.g., querying registries, comparing versions, writing updated files). The description is also front-loaded with trigger phrases, making it read more like a keyword list than a natural description.
Suggestions
Add 2-3 specific concrete actions describing what the skill actually does, e.g., 'Queries package registries and release pages to find latest versions, updates reference YAML/JSON files with current version pins, and summarizes what changed.'
Restructure to lead with capabilities first, then trigger phrases, e.g., 'Updates version-sensitive infrastructure reference files (tool versions, MCP packages, CLI commands, base image tags, IaC patterns) by researching current releases online. Use when...'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description mentions some concrete elements like 'tool versions, MCP packages, CLI commands, base image tags, IaC patterns' and 'version-sensitive infrastructure reference files', but the actual actions are vague — it only says 'update' and 'using online research' without detailing specific steps like 'query package registries', 'compare semantic versions', or 'update YAML files'. | 2 / 3 |
Completeness | The description explicitly answers both 'what' (update version-sensitive infrastructure reference files including tool versions, MCP packages, CLI commands, base image tags, IaC patterns using online research) and 'when' (with a comprehensive list of trigger phrases and the general condition 'wants to update the version-sensitive infrastructure reference files'). The 'Use when' equivalent is clearly stated. | 3 / 3 |
Trigger Term Quality | The description includes an extensive list of natural trigger phrases users would say, covering many variations like 'refresh infra references', 'update tool versions', 'check for new MCPs', 'update base images', 'refresh CLI versions', and shorthand forms like 'arn-infra-refresh'. This provides excellent coverage of how users would naturally invoke this skill. | 3 / 3 |
Distinctiveness Conflict Risk | The skill occupies a very specific niche — refreshing infrastructure version references using online research. The trigger terms are highly distinctive (e.g., 'arn infra refresh', 'update MCP servers', 'refresh registries') and unlikely to conflict with general coding, deployment, or documentation skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
47%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill has excellent workflow clarity with well-defined steps, validation checkpoints, and comprehensive error handling. However, it is significantly over-verbose — the three experience-level UI variants, full message templates, and exhaustive error cases inflate the token cost substantially. Actionability is moderate: while paths and structures are referenced, there are no executable code examples for key operations like checksum computation or JSON manipulation.
Suggestions
Reduce verbosity by extracting the experience-level-specific UI flows into a separate reference file (e.g., scope-selection-flows.md) and keeping only the logic summary in SKILL.md
Add a concrete example of the .reference-checksums.json schema so Claude knows the exact structure to read/write
Replace the verbose error handling prose with a concise table (Error → Action → Message) to cut token usage by ~50% in that section
Remove the full verbatim output message templates — Claude can generate appropriate summary messages from a brief format specification
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~250+ lines with extensive templated output messages, detailed UI flows for three experience levels, and exhaustive error handling cases. Much of this could be condensed significantly — Claude doesn't need full message templates spelled out verbatim, and the three experience-level variants of scope selection are repetitive. The error handling section alone has 10 cases with full prose descriptions. | 1 / 3 |
Actionability | The skill provides concrete steps and references specific file paths, tools, and data structures (checksums JSON, manifest files, agent invocation). However, there is no executable code — no actual SHA-256 computation commands, no example JSON schema for .reference-checksums.json, and the agent invocation is described in prose rather than with a concrete tool call example. Key details like the exact structure of the checksums file are left implicit. | 2 / 3 |
Workflow Clarity | The 6-step workflow is clearly sequenced with explicit validation checkpoints: checksum comparison before writes (Step 5, item 3), conflict detection with user prompts, reconciliation of new files in Step 1, and a comprehensive error handling section covering 10 failure modes with specific recovery actions. The feedback loop for checksum mismatches is well-defined. | 3 / 3 |
Progressive Disclosure | The skill references external files appropriately (reference-manifest.md, research-strategies.md, experience-derivation.md, ensure-config.md) with clear paths, which is good progressive disclosure. However, the SKILL.md itself is monolithic — the three experience-level UI flows, the detailed error handling, and the output templates could be externalized. No bundle files were provided to verify the referenced paths exist. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
b9084b6
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.