debug-with-file

Interactive hypothesis-driven debugging with documented exploration, understanding evolution, and analysis-assisted correction.

Quality

19%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./.codex/skills/debug-with-file/SKILL.md

Quality

Discovery

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This description relies heavily on abstract, academic-sounding phrases without providing concrete actions, natural trigger terms, or explicit usage guidance. It fails to communicate what specific tasks the skill performs or when Claude should select it, making it nearly useless for skill selection among multiple options.

Suggestions

Replace abstract phrases with concrete actions, e.g., 'Systematically diagnoses code bugs by analyzing error messages, inspecting stack traces, forming hypotheses, and testing fixes.'

Add an explicit 'Use when...' clause with natural trigger terms like 'debug', 'fix bug', 'error', 'not working', 'crash', 'unexpected behavior'.

Clarify what makes this skill distinct from general coding help — e.g., specify it follows a structured debugging methodology with documented steps, and mention the types of bugs or languages it targets.

Dimension	Reasoning	Score
Specificity	The description uses abstract, buzzword-heavy language like 'hypothesis-driven debugging', 'documented exploration', 'understanding evolution', and 'analysis-assisted correction' without listing any concrete actions. No specific operations (e.g., 'set breakpoints', 'analyze stack traces', 'inspect variables') are mentioned.	1 / 3
Completeness	The description vaguely addresses 'what' with abstract concepts but provides no explicit 'when' clause or trigger guidance. There is no 'Use when...' or equivalent, and the 'what' itself is too vague to be useful.	1 / 3
Trigger Term Quality	The terms used are academic/jargon-heavy ('hypothesis-driven', 'understanding evolution', 'analysis-assisted correction') rather than natural keywords a user would say. A user would say 'debug', 'fix bug', 'error', 'crash', 'not working' — none of which appear here.	1 / 3
Distinctiveness Conflict Risk	The description is so vague that 'debugging' could overlap with any code-related skill, and the modifiers ('hypothesis-driven', 'documented exploration') don't narrow the scope to a clear niche. It's unclear what distinguishes this from general coding assistance or error fixing.	1 / 3
	Total	4 / 12 Passed

Implementation

39%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The skill has excellent workflow clarity with well-defined modes, decision trees, and error handling, but suffers severely from verbosity and lack of progressive disclosure. Content is repeated across sections (iteration flow duplicates execution process, understanding template appears twice), and core logic relies on undefined pseudocode functions rather than executable implementations. The skill would benefit enormously from splitting into a concise overview SKILL.md with referenced template and schema files.

Suggestions

Extract the understanding.md template, hypotheses.json schema, NDJSON format spec, and instrumentation templates into separate bundle files referenced from the main SKILL.md to reduce size by ~60%.

Remove duplicate content: the 'Iteration Flow' section largely repeats the 'Execution Process' section, and the understanding template appears twice — keep only one instance of each.

Either implement the undefined helper functions (extractErrorKeywords, analyzeSearchResults, evaluateEvidence, removeDebugRegions) with real code, or remove them and describe the expected behavior in 1-2 lines each.

Cut explanatory text that Claude doesn't need (e.g., the Chinese comment about project root detection, the 'When to Use' section, the 'Key Features' summary table that just restates what's already in the document).

Dimension	Reasoning	Score
Conciseness	Extremely verbose at ~400+ lines. Massive amounts of template code that is pseudocode/illustrative rather than executable, repeated workflow descriptions (the iteration flow section largely duplicates the execution process section), and includes explanations Claude doesn't need (e.g., what NDJSON is, basic groupBy operations). The understanding.md template appears twice in slightly different forms.	1 / 3
Actionability	Provides code templates and structured formats (NDJSON schema, hypothesis JSON), but most code is pseudocode with placeholder functions like `extractErrorKeywords()`, `analyzeSearchResults()`, `evaluateEvidence()`, and `removeDebugRegions()` that are never defined. The instrumentation templates (Python/JS) are the most concrete and executable parts, but the core logic relies on undefined helper functions.	2 / 3
Workflow Clarity	The multi-step workflow is clearly sequenced with explicit mode detection (explore/analyze/continue), validation checkpoints (hypothesis evaluation, verification after fix), feedback loops (fix doesn't work → iterate, all rejected → new hypotheses, >5 iterations → escalate), and an error handling table covering edge cases. The ASCII tree diagrams clearly show decision points and branching.	3 / 3
Progressive Disclosure	Monolithic wall of text with no bundle files or external references. Everything is inlined into a single massive document — the understanding.md template, hypothesis JSON schema, NDJSON format, consolidation rules, instrumentation templates, and iteration flow could all be separate referenced files. The same information (workflow, templates) is repeated in multiple sections.	1 / 3
	Total	7 / 12 Passed

Validation

81%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 9 / 11 Passed

Validation for skill structure

Criteria	Description	Result
skill_md_line_count	SKILL.md is long (622 lines); consider splitting into references/ and linking	Warning
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	9 / 11 Passed

Repository: catlog22/Claude-Code-Workflow
Commit: 5ff5e86

Reviewed: 10 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.