Start Tilt dev environment in tmux, monitor bootstrap to healthy state, fix Tiltfile bugs without hard-coding or fallbacks. Use when starting tilt, debugging Tiltfile errors, or bootstrapping a dev environment.
96
96%
Does it follow best practices?
Impact
92%
1.33xAverage score across 3 eval scenarios
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong skill description that clearly identifies specific actions (starting Tilt in tmux, monitoring bootstrap health, fixing Tiltfile bugs), includes natural trigger terms developers would use, and provides an explicit 'Use when' clause. The description is concise, uses third-person voice, and occupies a distinct niche that minimizes conflict risk with other skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: starting Tilt dev environment in tmux, monitoring bootstrap to healthy state, and fixing Tiltfile bugs. Also specifies constraints (without hard-coding or fallbacks), which adds precision. | 3 / 3 |
Completeness | Clearly answers both 'what' (start Tilt in tmux, monitor bootstrap, fix Tiltfile bugs) and 'when' with an explicit 'Use when...' clause covering starting tilt, debugging Tiltfile errors, and bootstrapping a dev environment. | 3 / 3 |
Trigger Term Quality | Includes strong natural trigger terms users would say: 'tilt', 'Tiltfile', 'tmux', 'bootstrap', 'dev environment', 'debugging'. These cover the key terms a developer would use when needing this skill. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive with a clear niche around Tilt/Tiltfile and tmux-based dev environment bootstrapping. Unlikely to conflict with other skills due to the specificity of the tooling mentioned. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
92%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a high-quality skill that provides clear, actionable guidance for a complex multi-step workflow. The principles section effectively establishes constraints without over-explaining, and the workflow includes proper validation checkpoints and error recovery loops. The only notable weakness is that the content could benefit from slightly better progressive disclosure with explicit file references rather than inline mentions.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is lean and efficient throughout. Every section serves a clear purpose—principles establish constraints, workflow provides concrete steps. No unnecessary explanations of what Tilt, tmux, or Kubernetes are. The bullet points are terse and actionable. | 3 / 3 |
Actionability | Provides fully executable bash commands for every step: checking tilt status, starting in tmux, polling for convergence with jq queries, reading logs. The tmux session management code is copy-paste ready with proper conditionals. | 3 / 3 |
Workflow Clarity | Clear 5-step sequence with explicit validation checkpoints: Step 3 defines convergence criteria and polling strategy, Step 4 includes a feedback loop (fix → live-reload → re-poll), and there's an explicit escalation path after 3 failed iterations. The workflow handles both fresh starts and already-running states. | 3 / 3 |
Progressive Disclosure | The content is well-structured with clear sections and appropriate length for inline content, but references to external skills ('Follow the tmux skill patterns') and external files (silo.toml, README) could be better signaled with explicit links. The Principles section could potentially be a separate reference file given its length. | 2 / 3 |
Total | 11 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
f772de4
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.