CtrlK
BlogDocsLog inGet started
Tessl Logo

improvement-loop

Use after resolving a bug, failed task, or unexpected agent behavior to improve the pipeline skills, agents, hooks, or scripts that contributed to the problem. Also proactively suggest improvements when recurring patterns or inefficiencies are observed.

81

1.25x
Quality

72%

Does it follow best practices?

Impact

93%

1.25x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./.claude/skills/improvement-loop/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-crafted process skill with excellent workflow clarity and actionability — the gate check, five-step cycle, routing tables, and anti-pattern guards provide concrete, executable guidance. Its main weakness is verbosity: the core principle ('fix first, improve last') is restated in at least five different ways across sections, and the Graphviz diagrams consume tokens for relatively simple decision logic. The monolithic structure is acceptable given no bundle exists, but the document would benefit from trimming redundant emphasis.

Suggestions

Consolidate the repeated 'fix first, improve later' messaging — the gate check section, red flags, key insight, and preventing drift all hammer the same point. State it once prominently and reference it elsewhere.

Replace the Graphviz dot diagrams with compact bullet-point decision trees to save tokens while preserving clarity (Claude can't render Graphviz anyway).

DimensionReasoningScore

Conciseness

The skill is reasonably well-structured but verbose for what it conveys. The Graphviz diagrams add visual clarity but consume significant tokens for decision trees that could be simple bullet lists. Several sections (Red Flags, Preventing Improvement Drift, Key Insight) repeat the same core message ('fix first, improve later') multiple times. The batching section's example dialogue template is useful but the overall document could be tightened by ~30%.

2 / 3

Actionability

The skill provides highly concrete, actionable guidance: specific git commit message formats, exact verification steps per change type, a routing table mapping change types to specific tools/agents, classification tables with examples, and template dialogue for user communication. The five-step cycle is prescriptive with clear do/don't boundaries.

3 / 3

Workflow Clarity

The workflow is exceptionally clear with an explicit mandatory gate check before any improvement work, a well-sequenced five-step cycle, verification steps mapped to each change type, and explicit feedback loops (fix → verify → re-validate). The anti-pattern table for preventing drift serves as guardrails. Destructive/risky operations (pipeline edits) are gated behind user confirmation and verification.

3 / 3

Progressive Disclosure

The content is well-organized with clear sections and headers, but it's entirely monolithic — everything lives in one file with no references to supporting documents. The routing table mentions 'writing-skills' and 'writing-agents' skills but doesn't link to them. For a skill of this length (~200+ lines), some content (like the batching section or the detailed classification table) could be split into referenced files. However, no bundle files exist to reference.

2 / 3

Total

10

/

12

Passed

Description

67%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description does well at establishing when the skill should be triggered, with clear temporal and situational cues. However, the actual capabilities are described in vague terms ('improve', 'suggest improvements') rather than concrete actions, and the trigger terms could better match natural user language. The broad scope covering agents, hooks, scripts, and pipeline skills creates some conflict risk with more specialized skills.

Suggestions

Replace vague verbs like 'improve' with specific concrete actions such as 'update skill instructions', 'add error handling to scripts', 'refine agent prompts', or 'add validation hooks'.

Add natural user-facing trigger terms like 'postmortem', 'root cause', 'fix', 'error', 'retry', 'flaky' to improve matching with how users naturally describe these situations.

DimensionReasoningScore

Specificity

The description names a domain (improving pipeline skills, agents, hooks, scripts) and some actions (resolve, improve, suggest improvements), but the actions are somewhat vague—'improve' and 'suggest improvements' are not concrete operations like 'refactor', 'update configuration', or 'add error handling'.

2 / 3

Completeness

The description clearly answers both 'what' (improve pipeline skills, agents, hooks, or scripts) and 'when' (after resolving a bug, failed task, or unexpected agent behavior; also when recurring patterns or inefficiencies are observed). The 'Use after...' and 'Also proactively suggest...' clauses serve as explicit trigger guidance.

3 / 3

Trigger Term Quality

Includes some relevant terms like 'bug', 'failed task', 'agent behavior', 'pipeline', 'hooks', 'scripts', and 'recurring patterns', but misses common user phrasings like 'fix', 'error', 'debug', 'postmortem', 'retrospective', or 'root cause'. The terms lean more toward system vocabulary than natural user language.

2 / 3

Distinctiveness Conflict Risk

While the post-incident improvement focus is somewhat distinctive, terms like 'bug', 'failed task', and 'improve scripts' could overlap with debugging skills, general code improvement skills, or CI/CD pipeline skills. The scope is broad enough to potentially conflict with multiple other skills.

2 / 3

Total

9

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
aaddrick/claude-desktop-debian
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.