Operate the joelclaw Kubernetes cluster — Talos Linux on Colima (Mac Mini). Deploy services, check health, debug pods, recover from restarts, add ports, manage Helm releases, inspect logs, fix networking. Triggers on: 'kubectl', 'pods', 'deploy to k8s', 'cluster health', 'restart pod', 'helm install', 'talosctl', 'colima', 'nodeport', 'flannel', 'port mapping', 'k8s down', 'cluster not working', 'add a port', 'PVC', 'storage', any k8s/Talos/Colima infrastructure task. Also triggers on service-specific deploy: 'deploy redis', 'redeploy inngest', 'livekit helm', 'pds not responding'.
89
88%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Advisory
Suggest reviewing before use
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that clearly defines a specific infrastructure domain (a named Kubernetes cluster with a particular tech stack), enumerates concrete actions comprehensively, and provides an extensive explicit trigger list covering CLI commands, problem descriptions, and service-specific operations. The description is thorough without being padded, and its specificity to a named cluster and stack makes it highly distinctive.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: deploy services, check health, debug pods, recover from restarts, add ports, manage Helm releases, inspect logs, fix networking. Very comprehensive enumeration of capabilities. | 3 / 3 |
Completeness | Clearly answers both 'what' (operate the joelclaw Kubernetes cluster, deploy services, debug pods, etc.) and 'when' (explicit 'Triggers on:' clause with extensive list of trigger terms and scenarios). The trigger guidance is explicit and thorough. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms users would actually say, including CLI tools ('kubectl', 'talosctl', 'helm install'), problem descriptions ('k8s down', 'cluster not working', 'pds not responding'), actions ('deploy redis', 'restart pod', 'add a port'), and infrastructure concepts ('nodeport', 'flannel', 'PVC', 'storage'). | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive — names a specific cluster ('joelclaw'), specific infrastructure stack (Talos Linux on Colima on Mac Mini), and includes very targeted trigger terms like 'talosctl', 'colima', 'flannel', and service-specific deploys. Unlikely to conflict with generic deployment or coding skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a highly actionable and operationally rich skill document that serves as a comprehensive runbook for a specific Kubernetes cluster. Its greatest strengths are the concrete, executable commands and the hard-won incident-driven knowledge (with specific dates and ADR references) that would be impossible for Claude to infer. Its main weakness is length—at 600+ lines, it could benefit from offloading sections like the Agent Runner contract, NAS NFS details, and some danger zone items into separate reference files to keep the main skill leaner.
Suggestions
Move the Agent Runner section (Job Lifecycle, Resource Defaults, Security, Verification Commands, Current State) into a separate reference file like `references/agent-runner.md` and link to it from the main skill.
Move the NAS NFS Access section into `references/nas-nfs.md` with a brief summary and link in the main skill, keeping only the mount snippet and key rules inline.
Consider splitting the 18-item Danger Zones list into a `references/danger-zones.md` file, keeping only the top 5 most critical items inline with a link to the full list.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is very long (~600+ lines) and contains substantial operational detail that is valuable but could be more efficiently organized. Some sections like the Agent Runner contract and incident-specific details (dates, ADR numbers) add bulk. However, most content is genuinely unique operational knowledge that Claude wouldn't know, so it's not explaining basics—it's just a lot of domain-specific information that could benefit from more aggressive offloading to reference files. | 2 / 3 |
Actionability | Excellent actionability throughout—concrete bash commands, kubectl invocations, YAML manifests, curl health checks, and specific file paths are provided for virtually every operation. The Redis AOF recovery, kubeconfig fix, NFS verification, deploy commands, and troubleshooting sections are all copy-paste ready with real commands and real values. | 3 / 3 |
Workflow Clarity | Multi-step processes are clearly sequenced with explicit validation checkpoints. The Redis AOF recovery has numbered steps with verification (check logs, then clean up, then restart). The deploy commands section sequences manifest application with rollout status checks. The durable recovery rule explicitly defines what counts as successful recovery with verification criteria. Feedback loops are present for error-prone operations like Colima crash recovery and NFS connectivity verification. | 3 / 3 |
Progressive Disclosure | The skill does reference `references/operations.md` for recovery procedures, port mappings, and cluster recreation, which is good. However, the main file is extremely long and contains sections like the full Agent Runner specification, NAS NFS details, and extensive danger zones that could be split into separate reference files. The architecture is front-loaded well, but the sheer volume of inline content undermines the progressive disclosure pattern. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (535 lines); consider splitting into references/ and linking | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 9 / 11 Passed | |
ce9ca8e
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.