CtrlK
BlogDocsLog inGet started
Tessl Logo

arn-spark-spike

This skill should be used when the user says "spike", "arn spike", "validate risks", "technical validation", "proof of concept", "validate architecture", "risk spike", "test this risk", "will this work", "technical spike", "validate the stack", or wants to validate critical technical risks from the architecture vision by creating minimal proof-of-concept code and testing whether the chosen technologies work as expected.

56

Quality

64%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/arn-spark/skills/arn-spark-spike/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

82%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description excels at providing explicit trigger terms and clearly stating when the skill should be used, making it strong on completeness and trigger term quality. However, the actual capabilities (the 'what') could be more specific—listing concrete actions like 'creates isolated test projects, runs integration tests, benchmarks performance' would strengthen specificity. The description is also somewhat at risk of overlapping with general prototyping or architecture skills due to broad terms like 'proof of concept'.

Suggestions

Add more specific concrete actions beyond 'creating minimal proof-of-concept code'—e.g., 'creates isolated test projects, runs integration tests against APIs, validates library compatibility, benchmarks performance characteristics'.

Narrow some of the broader trigger terms or add qualifying context to reduce overlap with general prototyping or architecture skills—e.g., clarify that this is specifically for validating identified risks from an architecture vision document.

DimensionReasoningScore

Specificity

The description mentions some concrete actions like 'creating minimal proof-of-concept code' and 'testing whether the chosen technologies work as expected,' but it doesn't list multiple distinct specific capabilities. The actions are somewhat vague—'validate critical technical risks' is more of a goal than a concrete action.

2 / 3

Completeness

The description explicitly answers both 'what' (validate critical technical risks by creating minimal proof-of-concept code and testing technologies) and 'when' (with a comprehensive list of trigger phrases prefaced by 'This skill should be used when'). Both dimensions are clearly addressed.

3 / 3

Trigger Term Quality

The description includes an extensive list of natural trigger terms users would actually say: 'spike', 'proof of concept', 'will this work', 'validate architecture', 'technical spike', 'test this risk', etc. These cover many natural variations of how a user might request this functionality.

3 / 3

Distinctiveness Conflict Risk

While terms like 'spike' and 'arn spike' are fairly distinctive, broader terms like 'proof of concept', 'will this work', and 'validate architecture' could overlap with general coding, architecture design, or prototyping skills. The niche of risk validation spikes is somewhat specific but not fully distinct from adjacent skills.

2 / 3

Total

10

/

12

Passed

Implementation

47%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured conversational workflow skill with excellent step sequencing and error handling, but it suffers from significant verbosity and redundancy. Configuration lookup instructions are repeated across multiple steps, and the parallel-execution warning appears three times. The skill would benefit from concrete examples of spike POC structures and validation criteria rather than abstract descriptions.

Suggestions

Extract the repeated CLAUDE.md/Arness configuration lookup logic into a single 'Configuration Resolution' section referenced by later steps, eliminating the three near-identical blocks.

Add a concrete example of a spike POC (e.g., a sample spike directory structure, a sample validation criterion like 'WebRTC connection established in < 2s', and sample agent invocation parameters) to improve actionability.

Move the Agent Invocation Guide and Error Handling tables to a separate reference file (e.g., references/spike-error-handling.md) to reduce the main skill's token footprint.

Consolidate the parallel/background agent warning into a single prominent callout rather than repeating it in Steps 3, Agent Invocation Guide, and Error Handling.

DimensionReasoningScore

Conciseness

The skill is extremely verbose at ~200+ lines with significant redundancy. Configuration lookup logic is repeated multiple times (Steps 1, 3, 4 all re-read CLAUDE.md for Arness sections). The agent invocation guide and error handling sections repeat information already covered in the workflow (e.g., parallel/background agent warnings appear three times). Much of the conversational template text could be condensed.

1 / 3

Actionability

The skill provides a clear conversational workflow with specific steps and user interaction patterns, but lacks concrete executable code or commands. The spike execution is delegated to an agent with only abstract descriptions of what to pass. There are no code examples, no sample spike POC structures, and the validation criteria are described abstractly rather than with concrete examples.

2 / 3

Workflow Clarity

The multi-step workflow is clearly sequenced (Steps 1-6) with explicit validation checkpoints: user approval before each spike, result presentation after each spike, failure resolution before proceeding, and architecture vision update confirmation. The sequential execution requirement is emphasized with clear rationale. Error recovery paths are well-defined with specific fallback actions.

3 / 3

Progressive Disclosure

The skill references external files (spike-report-template.md, ensure-config.md, agent-models/spark.md) which is good progressive disclosure, but the main body itself is monolithic with substantial inline content that could be split out. The Agent Invocation Guide and Error Handling sections are large tables/lists that could be separate reference files. No bundle files were provided to verify referenced paths exist.

2 / 3

Total

8

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
AppsVortex/arness
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.