Use when encountering any bug, test failure, or unexpected behavior, before proposing fixes
Install with Tessl CLI
npx tessl i github:joshmanders/dotfiles --skill systematic-debugging64
Quality
50%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Optimize this skill with Tessl
npx tessl skill review --optimize ./claude/skills/systematic-debugging/SKILL.mdDiscovery
14%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This description fails to explain what the skill actually does, focusing only on when to use it. The trigger conditions are overly broad ('any bug, test failure, or unexpected behavior') making it likely to conflict with other debugging-related skills. Without knowing the skill's concrete actions, Claude cannot make informed decisions about when to select it.
Suggestions
Add concrete actions describing what the skill does (e.g., 'Systematically diagnoses root causes by analyzing stack traces, reproducing issues, and isolating variables').
Narrow the scope to reduce conflict risk - specify what type of debugging approach this represents (e.g., 'binary search debugging', 'log analysis', 'hypothesis-driven investigation').
Expand trigger terms with natural variations users would say: 'error', 'crash', 'not working', 'broken', 'failing tests', 'exception'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description uses vague language like 'any bug, test failure, or unexpected behavior' without describing concrete actions. It doesn't specify what the skill actually does - only when to use it. | 1 / 3 |
Completeness | The description only addresses 'when' to use the skill but completely omits 'what' the skill does. There's no indication of the actual capabilities or actions performed. | 1 / 3 |
Trigger Term Quality | Contains some natural keywords users might say ('bug', 'test failure', 'unexpected behavior'), but these are fairly generic debugging terms that could apply to many contexts. Missing specific variations like 'error', 'crash', 'failing tests', 'broken'. | 2 / 3 |
Distinctiveness Conflict Risk | Extremely generic scope covering 'any bug' or 'unexpected behavior' would conflict with virtually any debugging, testing, or troubleshooting skill. No clear niche is established. | 1 / 3 |
Total | 5 / 12 Passed |
Implementation
85%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a strong, well-structured debugging skill with excellent workflow clarity and actionability. The four-phase process is clearly sequenced with explicit validation gates and feedback loops. The main weakness is moderate verbosity—the content could be tightened by consolidating overlapping sections (red flags, rationalizations, partner signals) without losing clarity.
Suggestions
Consolidate the 'Red Flags', 'Rationalizations', and 'Partner Signals' sections into a single 'Warning Signs' section to reduce redundancy
Remove the 'Real-World Impact' section at the end—it's motivational rather than instructional and Claude doesn't need convincing
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is comprehensive but includes some redundancy (e.g., multiple tables restating similar points, repeated 'STOP' warnings). The rationalizations table and red flags section overlap significantly. Some sections could be tightened while preserving clarity. | 2 / 3 |
Actionability | Provides concrete, executable guidance with specific bash examples for diagnostic instrumentation, clear phase-by-phase instructions, and explicit criteria for each step. The multi-layer system example is copy-paste ready and demonstrates exactly what to do. | 3 / 3 |
Workflow Clarity | Excellent multi-step workflow with explicit phases, clear success criteria for each phase, and strong validation checkpoints ('If ≥ 3: STOP', 'MUST have before fixing'). Includes feedback loops for error recovery and explicit gates preventing progression without completion. | 3 / 3 |
Progressive Disclosure | Well-structured with clear overview, phases broken into digestible sections, and appropriate references to supporting techniques in separate files (root-cause-tracing.md, defense-in-depth.md). Quick reference table provides scannable summary without duplicating detail. | 3 / 3 |
Total | 11 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 13 / 16 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
metadata_version | 'metadata' field is not a dictionary | Warning |
license_field | 'license' field is missing | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 13 / 16 Passed | |
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.