Content
0%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill reads as a persona description or resume rather than actionable instructions. It is overwhelmingly verbose, listing hundreds of technologies and concepts Claude already knows, while providing zero concrete code examples, commands, or executable guidance. The workflow is abstract and lacks the validation checkpoints needed for performance engineering tasks that can impact production systems.
Suggestions
Replace the extensive capability listings with a concise summary and provide concrete, executable examples (e.g., a k6 load test script, a Prometheus query for latency percentiles, an OpenTelemetry tracing setup snippet).
Add explicit validation checkpoints to the workflow, such as 'Run baseline load test and record p50/p95/p99 latencies before making changes' and 'Compare post-optimization metrics against baseline before declaring success'.
Remove the 'Behavioral Traits', 'Knowledge Base', 'Purpose', and 'Example Interactions' sections—these describe what Claude already knows or are persona-style content that wastes tokens without adding actionable value.
If detailed tool-specific guidance is needed, create separate reference files (e.g., LOAD_TESTING.md, OBSERVABILITY.md) and link to them from a concise SKILL.md overview.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose, listing exhaustive catalogs of tools, platforms, and concepts that Claude already knows. The 'Capabilities' section alone is a massive enumeration of technologies with no actionable depth—just bullet-point lists of things Claude is already familiar with. The 'Behavioral Traits', 'Knowledge Base', 'Example Interactions', and 'Purpose' sections largely repeat information already conveyed or that Claude inherently understands. | 1 / 3 |
Actionability | There is no concrete, executable code, no specific commands, no example inputs/outputs, and no copy-paste-ready guidance. The entire skill is abstract descriptions and tool name-dropping. The 'Instructions' section has four vague steps like 'Collect traces, profiles, and load tests to isolate bottlenecks' with no specifics on how to do any of it. | 1 / 3 |
Workflow Clarity | The four-step 'Instructions' workflow is extremely high-level with no validation checkpoints, no feedback loops, no error recovery steps, and no concrete sequencing. The 'Response Approach' section lists 9 steps but they are equally abstract. For a skill involving potentially destructive operations like load testing production, the lack of concrete validation steps is a significant gap. | 1 / 3 |
Progressive Disclosure | The content is a monolithic wall of text with no references to external files, no layered structure, and no navigation aids. Hundreds of lines of capability listings are inlined that could be split into focused reference documents. There are no bundle files to support progressive disclosure. | 1 / 3 |
Total | 4 / 12 Passed |