cerebro-regression-tests

Add focused regression coverage for Cerebro review findings, bugs, and security edge cases.

1.11x

Quality

—

Does it follow best practices?

Impact

94%

1.11x

Average score across 3 eval scenarios

Securityby

Passed

No known issues

Quality

Content

100%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is a tight, well-organized instruction skill: concise, actionable with real commands and named subsystems, and sequenced with an explicit verification checkpoint. It is a strong example of an efficient, single-purpose skill body.

Dimension	Reasoning	Score
Conciseness	The body is a lean ~14 lines with no concept-explanation padding; every line is directive and assumes Claude's competence, so each token earns its place.	3 / 3
Actionability	It gives concrete, specific guidance: table-driven Go tests, named subsystems (parser, validator, projection, graph, source connector), enumerated security boundary categories, and real commands (`-count=1 -v`, `make verify`).	3 / 3
Workflow Clarity	The five-step instruction list is clearly sequenced and includes an explicit verification step (`-count=1 -v` then `make verify`) plus a Success Criteria that asserts the test fails against unfixed behavior, providing a feedback checkpoint.	3 / 3
Progressive Disclosure	The skill is under 50 lines, needs no external references, and is organized into clean ## Instructions and ## Success Criteria sections, satisfying the simple-skill allowance for a top progressive-disclosure score.	3 / 3
	Total	12 / 12 Passed

Description

50%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description clearly conveys a focused niche (regression coverage for Cerebro review findings) but is missing an explicit "Use when..." trigger clause and lacks multiple distinct concrete actions. Adding explicit invocation triggers and a couple more named actions would lift it across dimensions.

Suggestions

Add an explicit trigger clause, e.g. "Use when fixing Cerebro review findings, bugs, or security edge cases that warrant regression coverage."

List multiple concrete actions rather than a single verb, e.g. "Reproduce, write table-driven Go tests, and verify regression boundaries for...".

Include natural user phrasings like "regression test" or "add a test for this bug fix" to broaden trigger-term coverage.

Dimension	Reasoning	Score
Specificity	The description names the domain ("Cerebro review findings, bugs, and security edge cases") and an action ("Add focused regression coverage"), but relies on a single verb rather than listing multiple distinct concrete actions, so it stops short of the top anchor.	2 / 3
Completeness	It states what the skill does ("Add focused regression coverage...") but provides no "Use when..." clause or equivalent explicit trigger guidance, so the "when" dimension is only implied and is capped at 2 per the rubric guidelines.	2 / 3
Trigger Term Quality	Terms like "regression coverage", "bugs", and "security edge cases" are domain-relevant and would plausibly be said by a user, but common natural variations such as "test" or "regression test" are absent, leaving coverage incomplete.	2 / 3
Distinctiveness Conflict Risk	The Cerebro-specific scoping gives it a clear niche, but without explicit trigger phrases it could still overlap with general testing skills, so it does not reach the clearly-distinguishable top anchor.	2 / 3
	Total	8 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 16 / 16 Passed

Validation for skill structure

No warnings or errors.

Repository: writer/cerebro
Commit: f4c73ab

Reviewed: 4 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.