You are a technical debt expert specializing in identifying, quantifying, and prioritizing technical debt in software projects. Analyze the codebase to uncover debt, assess its impact, and create acti
34
30%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/codebase-cleanup-tech-debt/SKILL.mdQuality
Discovery
32%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description is truncated, which severely undermines its utility. Even the visible portion lacks a 'Use when...' clause, uses second-person voice ('You are'), and relies on somewhat generic action verbs rather than concrete deliverables. The technical debt focus provides some niche identity but needs sharper trigger terms and explicit usage guidance.
Suggestions
Fix the truncation and add an explicit 'Use when...' clause with trigger terms like 'technical debt', 'code quality audit', 'refactoring priorities', 'legacy code', 'code smells'.
Switch from second-person voice ('You are a technical debt expert') to third-person voice describing what the skill does (e.g., 'Identifies, quantifies, and prioritizes technical debt in software projects').
List specific concrete outputs such as 'generates debt inventory reports, estimates remediation effort, produces prioritized refactoring plans' to improve specificity.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain (technical debt) and some actions (identifying, quantifying, prioritizing, analyzing, assessing impact), but these are somewhat generic and not as concrete as listing specific deliverables or techniques. Also uses second person framing ('You are') which is problematic. | 2 / 3 |
Completeness | The description appears truncated (cuts off at 'create acti'), and even before truncation it only addresses 'what' without any explicit 'when should Claude use it' clause. The missing 'Use when...' clause and the truncation severely limit completeness. | 1 / 3 |
Trigger Term Quality | Includes relevant terms like 'technical debt', 'codebase', and 'prioritizing', which users might naturally say. However, it misses common variations like 'code quality', 'refactoring', 'legacy code', 'code smell', or 'maintainability'. | 2 / 3 |
Distinctiveness Conflict Risk | The focus on 'technical debt' provides some distinctiveness, but terms like 'analyze the codebase' and 'assess impact' are broad enough to overlap with general code review or code quality analysis skills. | 2 / 3 |
Total | 7 / 12 Passed |
Implementation
27%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill is a comprehensive but overly verbose template for technical debt analysis that reads more like a consulting playbook than an actionable Claude skill. It explains many concepts Claude already knows (code smells, testing types, deployment practices) and uses hypothetical placeholder examples rather than providing executable tools or commands. The content would be far more effective at 20% of its current length, focusing on the specific decision framework and output format rather than enumerating well-known software engineering concepts.
Suggestions
Reduce content by 70-80% by removing explanations of concepts Claude already knows (code smells, testing types, dependency management) and keeping only the specific decision framework, prioritization criteria, and output format.
Add concrete, executable commands for actual codebase analysis (e.g., specific static analysis tool invocations, grep patterns for detecting duplication, scripts for measuring complexity) rather than hypothetical cost calculations.
Split the monolithic content into a concise SKILL.md overview with references to separate files for templates (metrics-dashboard-template.md, remediation-plan-template.md, stakeholder-report-template.md).
Add validation checkpoints between major steps (e.g., 'Verify inventory completeness before proceeding to impact assessment' with specific criteria for what constitutes sufficient analysis).
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | Extremely verbose at ~300+ lines. Most content is generic technical debt knowledge that Claude already possesses (what cyclomatic complexity is, what code duplication means, how to calculate ROI). The examples use hypothetical placeholder data (e.g., '$36,000 annual cost') that aren't actionable for any real codebase. Nearly every section could be reduced to a few bullet points of novel guidance. | 1 / 3 |
Actionability | Contains some concrete code examples (facade pattern, YAML configs) and specific metrics thresholds, but most guidance is template-level rather than executable. The code snippets are illustrative pseudocode rather than copy-paste ready tools. The skill describes what to produce rather than providing concrete commands or scripts to actually analyze a codebase (e.g., no actual static analysis tool invocations). | 2 / 3 |
Workflow Clarity | The 8-step numbered sequence provides a clear ordering, but there are no validation checkpoints or feedback loops between steps. For a process that involves analyzing and modifying codebases (potentially destructive refactoring), there's no explicit 'verify before proceeding' step. The workflow reads more like a report template than an iterative process with error recovery. | 2 / 3 |
Progressive Disclosure | Monolithic wall of text with no references to external files and no bundle files to support it. All content is inline, including lengthy example templates, YAML configs, and code samples that could easily be split into separate reference files (e.g., metrics templates, refactoring patterns, communication templates). The skill would benefit enormously from splitting into overview + detailed reference files. | 1 / 3 |
Total | 6 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
f5dc9e3
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.