Queries Tilt resource status, logs, and manages dev environments. Use when checking deployment health, investigating errors, reading logs, or working with Tiltfiles.
100
Quality
100%
Does it follow best practices?
Impact
100%
1.29xAverage score across 3 eval scenarios
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a well-crafted skill description that follows best practices. It uses third person voice, lists specific capabilities, includes a clear 'Use when...' clause with natural trigger terms, and is distinctly scoped to Tilt development environments. The description effectively differentiates itself from generic DevOps or logging skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'Queries Tilt resource status, logs, and manages dev environments' covers querying status, reading logs, and environment management - three distinct capabilities. | 3 / 3 |
Completeness | Clearly answers both what ('Queries Tilt resource status, logs, and manages dev environments') AND when ('Use when checking deployment health, investigating errors, reading logs, or working with Tiltfiles') with explicit trigger guidance. | 3 / 3 |
Trigger Term Quality | Includes natural keywords users would say: 'deployment health', 'investigating errors', 'reading logs', 'Tiltfiles', 'dev environments'. These are terms developers naturally use when working with Tilt. | 3 / 3 |
Distinctiveness Conflict Risk | Clear niche targeting Tilt specifically - mentions 'Tilt resource status' and 'Tiltfiles' which are unique to this tool. Unlikely to conflict with generic logging or deployment skills due to Tilt-specific terminology. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
100%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is an excellent skill file that demonstrates strong technical writing. It's concise, immediately actionable with executable commands, establishes clear workflows starting with error checking, and appropriately delegates detailed content to reference files. The critical warning about live-reload behavior prevents a common mistake.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | Every section is lean and purposeful. No explanation of what Tilt is or how it works—assumes Claude knows. Commands are direct with minimal commentary. | 3 / 3 |
Actionability | All commands are copy-paste ready with concrete jq patterns. The tmux script is fully executable, and status values are explicitly enumerated for interpretation. | 3 / 3 |
Workflow Clarity | Opens with 'First Action: Check for Errors' establishing clear priority. The tmux section has conditional logic for different states. Critical warning about not restarting provides clear guardrails. | 3 / 3 |
Progressive Disclosure | SKILL.md provides essential commands inline while clearly pointing to TILTFILE_API.md and CLI_REFERENCE.md for deeper content. References are one level deep and well-signaled. | 3 / 3 |
Total | 12 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.