CtrlK
BlogDocsLog inGet started
Tessl Logo

skill-tuning

Universal skill diagnosis and optimization tool. Detect and fix skill execution issues including context explosion, long-tail forgetting, data flow disruption, and agent coordination failures. Supports Gemini CLI for deep analysis. Triggers on "skill tuning", "tune skill", "skill diagnosis", "optimize skill", "skill debug".

82

Quality

78%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./.claude/skills/skill-tuning/SKILL.md
SKILL.md
Quality
Evals
Security

Skill Tuning

Autonomous diagnosis and optimization for skill execution issues.

Architecture

┌─────────────────────────────────────────────────────┐
│  Phase 0: Read Specs (mandatory)                    │
│  → problem-taxonomy.md, tuning-strategies.md         │
└─────────────────────────────────────────────────────┘
                        ↓
┌─────────────────────────────────────────────────────┐
│  Orchestrator (state-driven)                         │
│  Read state → Select action → Execute → Update → ✓ │
└─────────────────────────────────────────────────────┘
        ↓                           ↓
┌──────────────────────┐   ┌──────────────────┐
│  Diagnosis Phase     │   │ Gemini CLI       │
│  • Context          │   │ Deep analysis    │
│  • Memory           │   │ (on-demand)      │
│  • DataFlow         │   │                  │
│  • Agent            │   │ Complex issues   │
│  • Docs             │   │ Architecture     │
│  • Token Usage      │   │ Performance      │
└──────────────────────┘   └──────────────────┘
                ↓
        ┌───────────────────┐
        │  Fix & Verify     │
        │  Apply → Re-test  │
        └───────────────────┘

Core Issues Detected

PriorityProblemRoot CauseFix Strategy
P0Authoring ViolationIntermediate files, state bloat, file relayeliminate_intermediate, minimize_state
P1Data Flow DisruptionScattered state, inconsistent formatsstate_centralization, schema_enforcement
P2Agent CoordinationFragile chains, no error handlingerror_wrapping, result_validation
P3Context ExplosionUnbounded history, full content passingsliding_window, path_reference
P4Long-tail ForgettingEarly constraint lossconstraint_injection, checkpoint_restore
P5Token ConsumptionVerbose prompts, state bloatprompt_compression, lazy_loading

Problem Categories (Detailed Specs)

See specs/problem-taxonomy.md for:

  • Detection patterns (regex/checks)
  • Severity calculations
  • Impact assessments

Tuning Strategies (Detailed Specs)

See specs/tuning-strategies.md for:

  • 10+ strategies per category
  • Implementation patterns
  • Verification methods

Workflow

StepActionOrchestrator DecisionOutput
1action-initstatus='pending'Backup, session created
2action-analyze-requirementsAfter initRequired dimensions + coverage
3Diagnosis (6 types)Focus areasstate.diagnosis.{type}
4action-gemini-analysisCritical issues OR user requestDeep findings
5action-generate-reportAll diagnosis completestate.final_report
6action-propose-fixesIssues foundstate.proposed_fixes[]
7action-apply-fixPending fixesApplied + verified
8action-completeQuality gates passsession.status='completed'

Action Reference

CategoryActionsPurpose
Setupaction-initInitialize backup, session state
Analysisaction-analyze-requirementsDecompose user request via Gemini CLI
Diagnosisaction-diagnose-{context,memory,dataflow,agent,docs,token_consumption}Detect category-specific issues
Deep Analysisaction-gemini-analysisGemini CLI: complex/critical issues
Reportingaction-generate-reportConsolidate findings → final_report
Fixingaction-propose-fixes, action-apply-fixGenerate + apply fixes
Verifyaction-verifyRe-run diagnosis, check gates
Exitaction-complete, action-abortFinalize or rollback

Full action details: phases/actions/

State Management

Single source of truth: .workflow/.scratchpad/skill-tuning-{ts}/state.json

{
  "status": "pending|running|completed|failed",
  "target_skill": { "name": "...", "path": "..." },
  "diagnosis": {
    "context": {...},
    "memory": {...},
    "dataflow": {...},
    "agent": {...},
    "docs": {...},
    "token_consumption": {...}
  },
  "issues": [{"id":"...", "severity":"...", "category":"...", "strategy":"..."}],
  "proposed_fixes": [...],
  "applied_fixes": [...],
  "quality_gate": "pass|fail",
  "final_report": "..."
}

See phases/state-schema.md for complete schema.

Orchestrator Logic

See phases/orchestrator.md for:

  • Decision logic (termination checks → action selection)
  • State transitions
  • Error recovery

Key Principles

  1. Problem-First: Diagnosis before any fix
  2. Data-Driven: Record traces, token counts, snapshots
  3. Iterative: Multiple rounds until quality gates pass
  4. Reversible: All changes with backup checkpoints
  5. Non-Invasive: Minimal changes, maximum clarity

Usage Examples

# Basic skill diagnosis
/skill-tuning "Fix memory leaks in my skill"

# Deep analysis with Gemini
/skill-tuning "Architecture issues in async workflow"

# Focus on specific areas
/skill-tuning "Optimize token consumption and fix agent coordination"

# Custom issue
/skill-tuning "My skill produces inconsistent outputs"

Output

After completion, review:

  • .workflow/.scratchpad/skill-tuning-{ts}/state.json - Full state with final_report
  • state.final_report - Markdown summary (in state.json)
  • state.applied_fixes - List of applied fixes with verification results

Reference Documents

DocumentPurpose
specs/problem-taxonomy.mdClassification + detection patterns
specs/tuning-strategies.mdFix implementation guide
specs/dimension-mapping.mdDimension ↔ Spec mapping
specs/quality-gates.mdQuality verification criteria
phases/orchestrator.mdWorkflow orchestration
phases/state-schema.mdState structure definition
phases/actions/Individual action implementations
Repository
catlog22/Claude-Code-Workflow
Last updated
Created

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.