detecting-performance-regressions

Automatically detect performance regressions in CI/CD pipelines by comparing metrics against baselines. Use when validating builds or analyzing performance trends. Trigger with phrases like "detect performance regression", "compare performance metrics", or "analyze performance degradation".

Quality

—

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Quality

Content

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is organized into recognizable sections but is descriptive rather than instructional: it restates the skill's purpose across overlapping sections and gives abstract steps with no executable commands or references to the bundled scripts that would carry them out.

Suggestions

Replace the abstract Instructions with concrete invocations of the bundled scripts (e.g., `python ${CLAUDE_SKILL_DIR}/scripts/analyze_metrics.py --baseline ...`), so Claude can act directly rather than describe.

Add explicit validation checkpoints to the workflow (e.g., verify baselines loaded before statistical analysis; on failure, follow Error Handling then re-run), turning the steps into a validate-fix-retry loop.

Collapse the overlapping Overview, How It Works, and Instructions sections into one concise process description and remove the 'When to Use' section that duplicates the frontmatter description.

Dimension	Reasoning	Score
Conciseness	Sections like Overview, How It Works, and Instructions restate the same baseline-comparison process in different words, and 'When to Use' duplicates the description; it is mostly readable but padded with redundant restatement that could be tightened.	2 / 3
Actionability	The Instructions ('Collect performance metrics', 'Apply statistical analysis') are abstract prose with no executable code or commands, and the body never references the bundled scripts (analyze_metrics.py, generate_report.py, create_github_comment.py) that would make it actionable.	1 / 3
Workflow Clarity	A six-step sequence is present in Instructions, but there are no validation checkpoints or an explicit validate-fix-retry loop; the Error Handling section lists generic checks instead of being wired into the workflow.	2 / 3
Progressive Disclosure	The body has clear section structure but does not link to the actual bundle files; the only path referenced (${CLAUDE_SKILL_DIR}/performance/baselines/) does not match the bundled scripts/ and assets/ directories, so navigation to the real reference materials is missing.	2 / 3
	Total	7 / 12 Passed

Description

90%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is well-structured: third person, concise, with an explicit 'Use when' clause and natural trigger phrases. Its only weakness is that the capability statement describes one main action rather than enumerating several concrete operations.

Dimension	Reasoning	Score
Specificity	The description names a concrete domain ('performance regressions in CI/CD pipelines') and a method ('comparing metrics against baselines'), but centers on a single core action rather than listing multiple distinct concrete operations like the score-3 anchor's 'extract, fill, merge'.	2 / 3
Completeness	It clearly states both what it does ('Automatically detect performance regressions... by comparing metrics against baselines') and when to use it ('Use when validating builds or analyzing performance trends'), with an explicit 'Use when' clause.	3 / 3
Trigger Term Quality	It provides explicit natural trigger phrases users would say ('detect performance regression', 'compare performance metrics', 'analyze performance degradation'), giving good coverage of common phrasings.	3 / 3
Distinctiveness Conflict Risk	CI/CD performance-regression detection is a clear niche with distinct triggers, making it unlikely to fire for unrelated skills.	3 / 3
	Total	11 / 12 Passed

Validation

87%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 14 / 16 Passed

Validation for skill structure

Criteria	Description	Result
allowed_tools_field	'allowed-tools' contains unusual tool name(s)	Warning
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	14 / 16 Passed

Repository: jeremylongshore/claude-code-plugins-plus-skills
Commit: 3022dd3

Reviewed: 2 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.