CtrlK
BlogDocsLog inGet started
Tessl Logo

arn-assessing

This skill should be used when the user says "assessing", "arness assessing", "assess", "assess codebase", "technical review", "codebase assessment", "find improvements", "what should I improve", "tech debt review", "pattern compliance check", "codebase health check", "improvement plan", "review my codebase", "what needs fixing", "code quality check", "audit my code", "run an assessment", "arn-assessing", or wants a comprehensive technical assessment of the codebase against stored patterns followed by prioritized improvement execution. Chains to arn-implementing if improvements are identified.

56

Quality

63%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Fix and improve this skill with Tessl

tessl review fix ./plugins/arn-code/skills/arn-assessing/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

62%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured orchestration skill with clear workflow sequencing and good error handling coverage. Its main weaknesses are moderate verbosity in explaining its own architectural role (meta-commentary about being a wrapper) and limited concrete actionability since most real work is delegated to sub-skills. The workflow clarity is its strongest dimension with explicit gates and failure paths.

Suggestions

Remove meta-commentary about being a 'medium orchestration wrapper' and the internal gate count of arn-code-assess — Claude doesn't need architectural justification, just instructions.

Provide the actual file read command or tool invocation syntax for Step 0 rather than just saying 'read and follow its instructions', making the step more directly executable.

DimensionReasoningScore

Conciseness

Generally efficient but includes some unnecessary meta-commentary (e.g., explaining it's a 'medium orchestration wrapper' and that it 'MUST NOT duplicate arn-code-assess logic'). The explanation that the assess skill has '7 internal decision gates' is internal architecture detail Claude doesn't need repeated. The ASCII progress indicator is a nice touch but adds tokens for marginal value.

2 / 3

Actionability

Steps are clearly described but rely heavily on invoking other skills ('Skill: arn-code:arn-code-assess') without concrete executable commands or code. The pre-flight check is specific about which files to verify, which is good, but the actual assessment work is entirely delegated. The file path references and AskUserQuestion usage are concrete, but overall the skill is more of a routing/orchestration description than executable guidance.

2 / 3

Workflow Clarity

The workflow is clearly sequenced (Step 0 → Step 1 → Step 2 → Step 3) with explicit validation checkpoints: Step 0 ensures config exists, Step 1 validates pattern docs before proceeding with a clear block-and-suggest path if missing, and error handling covers multiple failure modes including retry options. The 'no improvements found' edge case is handled explicitly.

3 / 3

Progressive Disclosure

References external skills and a config file path appropriately, but no bundle files are provided to verify the referenced paths exist. The skill delegates to arn-code-assess and arn-implementing but doesn't provide navigation links to their documentation. The content is reasonably structured but the inline error handling section could potentially be a reference if the skill grew.

2 / 3

Total

9

/

12

Passed

Description

64%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description excels at providing trigger terms but is weak on explaining what the skill actually does concretely. It reads more like a trigger-term list than a balanced skill description. The 'what it does' portion is vague ('comprehensive technical assessment against stored patterns') and would benefit from listing specific actions and outputs.

Suggestions

Add specific concrete actions describing what the assessment does, e.g., 'Analyzes codebase structure, checks adherence to stored architectural patterns, identifies tech debt, and generates a prioritized list of improvements with severity ratings.'

Restructure the description to lead with a clear 'what it does' statement before the trigger terms, e.g., 'Performs comprehensive codebase assessment by checking pattern compliance, identifying tech debt, and producing prioritized improvement plans. Use when...'

DimensionReasoningScore

Specificity

The description mentions 'comprehensive technical assessment of the codebase against stored patterns' and 'prioritized improvement execution', which names the domain and some actions, but doesn't list multiple specific concrete actions like what the assessment checks, what outputs it produces, or what 'stored patterns' means.

2 / 3

Completeness

The 'when' is very thoroughly covered with explicit trigger terms. However, the 'what' is weak — it only vaguely says 'comprehensive technical assessment of the codebase against stored patterns followed by prioritized improvement execution' without detailing what the skill actually does, what it outputs, or what specific checks it performs. The 'what' and 'when' are somewhat imbalanced.

2 / 3

Trigger Term Quality

The description includes an extensive list of natural trigger terms users would say: 'assess codebase', 'technical review', 'find improvements', 'what should I improve', 'tech debt review', 'code quality check', 'audit my code', 'what needs fixing', etc. These cover many natural variations of how a user would request this functionality.

3 / 3

Distinctiveness Conflict Risk

The skill has some distinct triggers like 'arness assessing' and 'arn-assessing' and mentions chaining to 'arn-implementing', which helps distinguish it. However, terms like 'code quality check', 'audit my code', 'find improvements', and 'technical review' are generic enough to overlap with linting, code review, or other code analysis skills.

2 / 3

Total

9

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
AppsVortex/arness
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.