debugging

Enforces systematic root-cause investigation before any fix attempt, using a four-phase process (investigate, analyze patterns, hypothesize, implement). Use when encountering bugs, test failures, build errors, unexpected behavior, performance problems, or integration issues — especially when under pressure, when a "quick fix" seems obvious, or when previous fix attempts have failed. DO NOT TRIGGER for new feature development or refactoring without a known defect.

Quality

73%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Fix and improve this skill with Tessl

tessl review fix ./skills/debugging/SKILL.md

Quality

Content

47%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The skill has a well-structured four-phase debugging workflow with good escalation logic and feedback loops, but is severely undermined by excessive verbosity. The same core principle is repeated in numerous forms (iron law, red flags, rationalizations table, human partner signals), and much of the content is motivational rather than instructional. Trimming redundant sections and making the remaining guidance more concrete would significantly improve this skill.

Suggestions

Cut content by ~50%: merge 'Red Flags,' 'Common Rationalizations,' and 'Human Partner Signals' into a single short checklist — they all say the same thing (stop guessing, return to Phase 1).

Remove the 'When to Use' don't-skip-when and excuse/reality sections entirely — Claude doesn't need motivational arguments about why process matters.

Make Phases 2-3 more actionable with concrete examples: show a specific diff comparison technique, a hypothesis template with actual filled-in example, or a minimal test change pattern rather than abstract checklists.

Move the rationalizations table and real-world impact statistics to a separate supporting file to reduce the main skill's token footprint.

Dimension	Reasoning	Score
Conciseness	Extremely verbose at ~250+ lines. Extensive sections on rationalizations, red flags, 'human partner signals,' and motivational content ('systematic is faster than thrashing') are things Claude already knows or doesn't need repeated multiple times. The same core message (don't fix without investigating) is restated in at least 6 different ways. The 'When to Use' section with 'Don't skip when' and 'Common Rationalizations' table are largely redundant with each other.	1 / 3
Actionability	The four-phase process provides a reasonable framework with some concrete examples (the bash diagnostic instrumentation example is good), but most guidance is procedural advice rather than executable commands or code. Phases 2-4 are largely abstract checklists ('Find working examples,' 'Form single hypothesis') rather than concrete, copy-paste-ready techniques. The one code example is domain-specific (macOS codesign) rather than generalizable.	2 / 3
Workflow Clarity	The four-phase workflow is clearly sequenced with explicit gates between phases ('MUST complete each phase before proceeding'). Phase 4 includes a well-designed feedback loop (if fix doesn't work, return to Phase 1; if 3+ fixes fail, escalate to architectural review). Validation checkpoints are present throughout, and the escalation path at 3+ failed fixes is a strong error-recovery mechanism.	3 / 3
Progressive Disclosure	References to supporting files (root-cause-tracing.md, defense-in-depth.md, condition-based-waiting.md) and related skills (kit:tdd, kit:verify) are clearly signaled. However, the main SKILL.md itself is monolithic with substantial content that could be split out (the rationalizations table, red flags section, and real-world impact section add bulk without being core workflow). No bundle files were provided to verify referenced paths exist.	2 / 3
	Total	8 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly communicates a specific debugging methodology, provides comprehensive trigger terms covering common debugging scenarios, and includes both positive triggers and explicit exclusions. The four-phase process naming adds specificity, and the 'DO NOT TRIGGER' clause is a strong differentiator that reduces conflict risk with other development-related skills.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions via the four-phase process (investigate, analyze patterns, hypothesize, implement) and clearly describes the methodology as 'systematic root-cause investigation before any fix attempt.'	3 / 3
Completeness	Clearly answers both 'what' (enforces systematic root-cause investigation using a four-phase process) and 'when' (explicit 'Use when...' clause with multiple trigger scenarios, plus a 'DO NOT TRIGGER' exclusion clause for additional clarity).	3 / 3
Trigger Term Quality	Excellent coverage of natural trigger terms users would encounter: 'bugs', 'test failures', 'build errors', 'unexpected behavior', 'performance problems', 'integration issues', 'quick fix', 'previous fix attempts have failed.' These are terms users naturally use when describing debugging scenarios.	3 / 3
Distinctiveness Conflict Risk	Highly distinctive — focuses specifically on a debugging methodology/process rather than a technology domain. The explicit exclusion of 'new feature development or refactoring without a known defect' further reduces conflict risk with other coding skills.	3 / 3
	Total	12 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: shousper/claude-kit
Commit: 08bc600

Reviewed: 14 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.