CtrlK
BlogDocsLog inGet started
Tessl Logo

arn-code-report

This skill should be used when the user says "report issue", "arness code report", "code report", "something went wrong", "report a bug", "file arness code issue", "arness code broke", "report arness code problem", "diagnose issue", "arness doctor", "run doctor", "diagnose arness code", "arn-code-report", or wants to report a problem with an Arness Code workflow skill. Invokes the arn-code-doctor agent to diagnose the issue, then files a GitHub issue on the Arness plugin repository. Do NOT use this for filing issues on the user's own project — use /arn-code-create-issue for that. For Spark issues use /arn-spark-report. For Infra issues use /arn-infra-report.

72

Quality

88%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-crafted, complex workflow skill with excellent actionability and workflow clarity. The multi-step process is clearly sequenced with appropriate validation checkpoints, consent gates, and comprehensive error handling with fallbacks. The main weaknesses are moderate verbosity (the scripted explanation in Step 1, repeated fallback filename patterns) and the inability to verify referenced bundle files, though the references themselves are well-structured.

Suggestions

Remove the scripted explanation in Step 1 — Claude can generate contextually appropriate explanations without a verbatim script, saving ~8 lines of tokens.

Define the fallback filename pattern once (e.g., at the top or in Constraints) rather than repeating `arness-code-report-<YYYY-MM-DD>.md` six times across the document.

DimensionReasoningScore

Conciseness

The skill is reasonably well-structured but includes some verbosity — the full explanation script in Step 1 is somewhat unnecessary since Claude can generate appropriate explanations contextually. The smart routing keyword lists and repeated fallback file naming patterns add bulk. However, most content is functional and necessary for the complex workflow.

2 / 3

Actionability

The skill provides concrete, executable commands (gh issue create with specific flags), specific file paths (${CLAUDE_PLUGIN_ROOT}/.claude-plugin/plugin.json), exact tool names (AskUserQuestion, Task tool), and clear decision trees. The bash command for issue creation is copy-paste ready, and fallback behaviors are explicitly defined.

3 / 3

Workflow Clarity

The workflow is clearly sequenced (Steps 0-6) with explicit validation checkpoints: user consent before submission (Step 5), label existence check with retry logic (Step 6), prerequisite verification (Step 3), and multiple fallback paths. The feedback loop for 'Edit first' is well-defined, and error handling covers all failure modes with specific recovery actions.

3 / 3

Progressive Disclosure

The skill references external files (arness-knowledge-base.md, issue-template.md, ensure-config.md) which is good progressive disclosure, but no bundle files were provided to verify these exist. The main SKILL.md itself is quite long (~150 lines of substantive content) and some sections like the detailed error handling could potentially be split out. The references are one-level deep and clearly signaled, which is positive.

2 / 3

Total

10

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong skill description that excels across all dimensions. It provides extensive trigger terms, clearly explains what the skill does and when to use it, and explicitly disambiguates from related skills to minimize conflict risk. The only minor note is that the description leads with trigger terms rather than a capability summary, but this doesn't materially harm its effectiveness.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: invoking the arn-code-doctor agent to diagnose issues, filing a GitHub issue on the Arness plugin repository. Also clearly distinguishes what it does NOT do.

3 / 3

Completeness

Clearly answers both 'what does this do' (diagnoses issues via arn-code-doctor agent, files GitHub issues on the Arness plugin repo) AND 'when should Claude use it' (explicit trigger phrases and a clear 'Use when' equivalent at the start). Also includes explicit negative boundaries for when NOT to use it.

3 / 3

Trigger Term Quality

Excellent coverage of natural trigger terms users would say, including many variations like 'report issue', 'something went wrong', 'report a bug', 'arness code broke', 'diagnose issue', 'run doctor', and the command alias 'arn-code-report'.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive with explicit disambiguation from related skills (/arn-code-create-issue for user project issues, /arn-spark-report for Spark issues, /arn-infra-report for Infra issues). The scope is narrowly defined to Arness Code workflow skill problems only.

3 / 3

Total

12

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
AppsVortex/arness
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.