Operate the joelclaw Kubernetes cluster — Talos Linux on Colima (Mac Mini). Deploy services, check health, debug pods, recover from restarts, add ports, manage Helm releases, inspect logs, fix networking. Triggers on: 'kubectl', 'pods', 'deploy to k8s', 'cluster health', 'restart pod', 'helm install', 'talosctl', 'colima', 'nodeport', 'flannel', 'port mapping', 'k8s down', 'cluster not working', 'add a port', 'PVC', 'storage', any k8s/Talos/Colima infrastructure task. Also triggers on service-specific deploy: 'deploy redis', 'redeploy inngest', 'livekit helm', 'pds not responding'.
62
75%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/k8s/SKILL.mdQuality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that clearly defines a specific operational domain (a named Kubernetes cluster with a particular tech stack), enumerates concrete actions comprehensively, and provides extensive explicit trigger terms covering both technical commands and natural language phrases. The description is well-structured, distinctive, and would enable accurate skill selection even among many competing skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: deploy services, check health, debug pods, recover from restarts, add ports, manage Helm releases, inspect logs, fix networking. Very comprehensive enumeration of capabilities. | 3 / 3 |
Completeness | Clearly answers both 'what' (operate the joelclaw Kubernetes cluster, deploy services, debug pods, etc.) and 'when' (explicit 'Triggers on:' clause with extensive list of trigger terms and scenarios). The trigger guidance is explicit and thorough. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms including both technical commands ('kubectl', 'talosctl', 'helm install') and natural language phrases ('cluster not working', 'k8s down', 'add a port', 'deploy redis'). Covers common variations and user phrasings. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive — scoped to a specific named cluster ('joelclaw'), specific infrastructure stack (Talos Linux on Colima on Mac Mini), and includes unique service-specific triggers ('deploy redis', 'redeploy inngest', 'livekit helm', 'pds not responding'). Very unlikely to conflict with other skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
50%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill is highly actionable with excellent concrete commands and real configuration values, making it genuinely useful for cluster operations. However, it suffers significantly from being a monolithic document that inlines extensive incident histories, 20 danger zones, and subsystem details (agent runner, NAS NFS, AIStor) that should be split into focused reference files. The sheer volume (~500+ lines) undermines token efficiency and makes it harder to find relevant information quickly.
Suggestions
Extract the 20 Danger Zones into a separate references/danger-zones.md file, keeping only the top 3-5 most critical warnings inline with links to the full list.
Move the Agent Runner section (Job lifecycle, resource defaults, security, verification) into its own references/agent-runner.md — it's a distinct subsystem that doesn't need to be in the main operations skill.
Move NAS NFS Access details into references/nas-nfs.md, keeping only a brief summary and mount example in the main file.
Add explicit validation checkpoints to the recovery workflows — e.g., after Colima crash recovery, include a numbered checklist with verification commands at each step rather than scattering symptoms and fixes across narrative paragraphs.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | This is an extremely long skill file (~500+ lines) that includes extensive incident histories, ADR references, danger zones (20 items), and operational details that could easily be split into reference files. Much of the content (e.g., detailed incident narratives like '2026-03-17 incident', lengthy explanations of why things broke) is verbose context that Claude doesn't need inline to perform operations. | 1 / 3 |
Actionability | The skill provides highly concrete, executable commands throughout — kubectl commands, bash scripts, curl health checks, YAML manifests for Redis AOF recovery, specific file paths, and exact port numbers. Nearly everything is copy-paste ready with real values. | 3 / 3 |
Workflow Clarity | Some workflows like Redis AOF recovery and deploy commands have clear step sequences, but many critical multi-step processes (e.g., full recovery after Colima crash, NAS route restoration, PDS rebuild) lack explicit validation checkpoints and are scattered across the document or deferred to references. The 20 danger zones are a flat list without clear decision trees for when each applies. | 2 / 3 |
Progressive Disclosure | The skill references operations.md for recovery procedures and port mappings, which is good, but the main file itself is monolithic — it inlines extensive incident details, 20 danger zones, full agent runner specs, NAS NFS details, and AIStor notes that should be in separate reference files. The architecture section, danger zones, and agent runner section alone could each be their own reference documents. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (537 lines); consider splitting into references/ and linking | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 9 / 11 Passed | |
03f0a59
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.