tessl i github:jeremylongshore/claude-code-plugins-plus-skills --skill running-load-testsCreate and execute load tests for performance validation using k6, JMeter, and Artillery. Use when validating application performance under load conditions or identifying bottlenecks. Trigger with phrases like "run load test", "create stress test", or "validate performance under load".
60%
Overall
Validation
Implementation
Activation
Validation
81%| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md line count is 97 (<= 500) | Pass |
frontmatter_valid | YAML frontmatter is valid | Pass |
name_field | 'name' field is valid: 'running-load-tests' | Pass |
description_field | 'description' field is valid (286 chars) | Pass |
description_voice | 'description' uses third person voice | Pass |
description_trigger_hint | Description includes an explicit trigger hint | Pass |
compatibility_field | 'compatibility' field not present (optional) | Pass |
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
metadata_version | 'metadata' field is not a dictionary | Warning |
metadata_field | 'metadata' field not present (optional) | Pass |
license_field | 'license' field is present: MIT | Pass |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
body_present | SKILL.md body is present | Pass |
body_examples | Examples detected (code fence or 'Example' wording) | Pass |
body_output_format | Output/return/format terms detected | Pass |
body_steps | Step-by-step structure detected (ordered list) | Pass |
Total | 13 / 16 Passed |
Implementation
20%This skill content is primarily descriptive rather than instructive, explaining what load testing is and what Claude 'will do' rather than providing concrete, executable guidance. It lacks any actual code examples for k6, JMeter, or Artillery despite claiming to support all three. The content would benefit significantly from replacing explanatory prose with executable script templates and specific commands.
Suggestions
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | Extremely verbose with extensive explanation of concepts Claude already knows. Sections like 'Overview', 'How It Works', and 'When to Use This Skill' explain what load testing is and describe Claude's own capabilities rather than providing actionable guidance. | 1 / 3 |
Actionability | No executable code or concrete commands provided. The 'Examples' section describes what the skill 'will do' rather than showing actual k6/JMeter/Artillery scripts. Instructions are abstract ('Generate appropriate load test scripts') rather than copy-paste ready. | 1 / 3 |
Workflow Clarity | Steps are listed in the Instructions section with a clear sequence, but there are no validation checkpoints or feedback loops. Missing explicit verification steps between script generation and execution, and no guidance on what to do if tests fail validation. | 2 / 3 |
Progressive Disclosure | Content is organized into sections but is monolithic - everything is in one file with no references to external documentation. The Resources section mentions documentation but provides no actual links. Content that could be split (tool-specific examples) is absent entirely. | 2 / 3 |
Total | 6 / 12 Passed |
Activation
100%This is a well-crafted skill description that excels across all dimensions. It specifies concrete actions with named tools, provides natural trigger phrases users would actually say, and clearly distinguishes itself from other testing or performance-related skills. The description follows best practices by using third person voice and explicitly stating both capabilities and usage triggers.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions ('Create and execute load tests') and names specific tools (k6, JMeter, Artillery) along with the purpose (performance validation, identifying bottlenecks). | 3 / 3 |
Completeness | Clearly answers both what (create/execute load tests using k6, JMeter, Artillery) and when (validating performance under load, identifying bottlenecks) with explicit trigger phrases provided. | 3 / 3 |
Trigger Term Quality | Includes natural trigger phrases users would say: 'run load test', 'create stress test', 'validate performance under load'. Also includes domain terms like 'performance validation' and 'bottlenecks'. | 3 / 3 |
Distinctiveness Conflict Risk | Clear niche focused specifically on load/stress testing with named tools. Distinct triggers like 'load test', 'stress test', and specific tool names make it unlikely to conflict with general testing or performance monitoring skills. | 3 / 3 |
Total | 12 / 12 Passed |
Reviewed
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.