The developer inner loop, end to end. Pair through implementation with TDD discipline, self-review before pushing, create a PR with context for reviewers, and process review feedback. Picks up a ready story and carries it across the finish line.
50
56%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./develop/skills/build/SKILL.mdQuality
Discovery
50%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description communicates a coherent end-to-end developer workflow covering TDD, self-review, PR creation, and feedback processing. Its main weaknesses are the lack of an explicit 'Use when...' clause, some metaphorical language that reduces precision, and missing common keyword variations that users would naturally use when requesting these capabilities.
Suggestions
Add an explicit 'Use when...' clause, e.g., 'Use when the user wants to implement a feature end-to-end, from picking up a ticket/story through coding with TDD, creating a pull request, and addressing code review feedback.'
Include common keyword variations users would naturally say: 'pull request', 'test-driven development', 'code review', 'feature implementation', 'ticket', 'task', 'branch'.
Replace metaphorical language ('carries it across the finish line', 'pair through implementation') with concrete actions like 'writes code using test-driven development, runs tests, commits changes, opens a pull request with descriptive context'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Names several actions (TDD implementation, self-review, create PR, process review feedback, pick up stories) but uses somewhat abstract/metaphorical language like 'carries it across the finish line' and 'pair through implementation'. The actions are identifiable but not as concrete as they could be. | 2 / 3 |
Completeness | The 'what' is reasonably covered (TDD implementation, self-review, PR creation, review feedback processing). However, there is no explicit 'Use when...' clause or equivalent trigger guidance — the when is only implied by the nature of the described workflow. Per rubric guidelines, missing 'Use when' caps completeness at 2. | 2 / 3 |
Trigger Term Quality | Contains some relevant keywords like 'TDD', 'PR', 'self-review', 'review feedback', 'story', and 'implementation' that users might naturally use. However, it misses common variations like 'pull request', 'test-driven development', 'code review', 'feature development', 'ticket', or 'task'. | 2 / 3 |
Distinctiveness Conflict Risk | The description covers a broad workflow ('developer inner loop, end to end') which could overlap with more focused skills for git operations, PR creation, code review, or TDD individually. The combination of all these steps together provides some distinctiveness, but the broad scope increases conflict risk with narrower skills. | 2 / 3 |
Total | 8 / 12 Passed |
Implementation
62%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-structured workflow skill that clearly sequences the developer inner loop through four distinct phases with good validation checkpoints and scope management guidance. Its main weaknesses are the lack of concrete executable examples (no actual commands, templates, or code snippets) and some redundancy between the main workflow, common patterns, and failure modes sections. The progressive disclosure is reasonable with clear references to supporting files, though the main document carries substantial inline content.
Suggestions
Add concrete executable examples: a sample lint/test command sequence for self-review, a filled-in PR description template, and an example review response — these would significantly boost actionability.
Consolidate failure modes into the relevant workflow steps as brief warnings rather than repeating guidance in a separate section, reducing redundancy and improving conciseness.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is mostly efficient and well-structured, but includes some redundancy — failure modes largely restate guidance already given in the four moves (e.g., scope drift is covered in both sections, TDD judgment is mentioned multiple times). Some sections like 'Your Stance' add flavor but limited actionable value. | 2 / 3 |
Actionability | Provides clear procedural guidance with specific checklists (self-review checks, PR structure, feedback categorization), but lacks concrete executable examples — no actual commands for lint/type-check/test runs, no example PR description template, no example commit message. The guidance is specific in intent but abstract in execution. | 2 / 3 |
Workflow Clarity | The four-move sequence is clearly defined with explicit ordering (implement → self-review → PR → feedback). Self-review includes a validation gate with traffic-light verdicts (✅/🟡/🔴), and the feedback loop has clear re-request-review checkpoints. Preconditions establish entry criteria, and scope drift is handled with explicit decision branches. | 3 / 3 |
Progressive Disclosure | References to supporting files are well-signaled (references/self-review.md, references/pr.md, references/feedback.md, model.md, guidelines.md, test/SKILL.md), but no bundle files were provided to verify these exist. The main file itself is fairly long (~150 lines) with 'Common Patterns' and 'Failure Modes' sections that could potentially be extracted. The structure is reasonable but the inline content is heavy for an overview document. | 2 / 3 |
Total | 9 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 9 / 11 Passed | |
632c389
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.