CtrlK
BlogDocsLog inGet started
Tessl Logo

grove-test

Create or fix tests for existing Grove code examples. Use when the user asks to "add a test", "create a test", "fix this test", "update the test", "the test doesn't match the output", "test is failing", or wants to add test coverage for an existing example or fix a broken test.

68

Quality

83%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Discovery

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong description with excellent trigger term coverage and clear completeness, explicitly listing many natural user phrases. The main weakness is that the 'what' portion could be slightly more specific about the concrete actions involved beyond 'create or fix tests'. Overall, it would perform well in a multi-skill selection scenario.

DimensionReasoningScore

Specificity

The description names the domain ('tests for existing Grove code examples') and two main actions ('create or fix tests'), but doesn't elaborate on specific concrete sub-actions like generating assertions, comparing expected output, or updating test fixtures.

2 / 3

Completeness

Clearly answers both 'what' (create or fix tests for existing Grove code examples) and 'when' (explicit 'Use when...' clause with multiple specific trigger phrases and scenarios like adding test coverage or fixing broken tests).

3 / 3

Trigger Term Quality

Excellent coverage of natural trigger phrases users would actually say: 'add a test', 'create a test', 'fix this test', 'update the test', 'the test doesn't match the output', 'test is failing', and 'add test coverage'. These are highly natural and varied.

3 / 3

Distinctiveness Conflict Risk

Scoped specifically to 'Grove code examples' testing, which is a clear niche. The combination of 'Grove' domain specificity and test-focused triggers makes it unlikely to conflict with general testing or general Grove skills.

3 / 3

Total

11

/

12

Passed

Implementation

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured, highly actionable skill with clear multi-step workflows and good validation checkpoints. Its main weakness is verbosity — particularly the extensive Step 0 handoff validation logic and inline error message templates that inflate the token cost. The progressive disclosure is reasonable with references to external convention files, though the skill itself carries substantial inline detail that could be offloaded.

Suggestions

Consider moving the Step 0 handoff file validation logic (version checks, shape checks, trigger schemas) into a separate reference file to reduce the main skill's token footprint.

Trim the inline error message templates in Step 0 — Claude can generate appropriate error messages from brief instructions without needing exact wording prescribed.

DimensionReasoningScore

Conciseness

The skill is reasonably efficient but includes some sections that could be tightened — the handoff file validation in Step 0 is quite verbose with extensive error message templates, and some explanations (e.g., when to add vs create test files in Step 4) could be more concise. However, it generally avoids explaining concepts Claude already knows.

2 / 3

Actionability

The skill provides highly concrete, actionable guidance: specific file paths, exact bash commands for running tests, a clear validation approach table, a concrete error fix walkthrough with actual error output, and specific thresholds (8 it-blocks, max 3 retry attempts). The guidance is specific enough to execute without ambiguity.

3 / 3

Workflow Clarity

The 8-step workflow is clearly sequenced with explicit validation checkpoints (Step 7 runs the test, with a retry loop capped at 3 attempts and clear failure reporting). The fix mode includes a diagnostic sequence (run test first, compare expected vs actual, verify imports/teardown). The feedback loop for test failures is explicit with a defined exit condition.

3 / 3

Progressive Disclosure

The skill references external convention files (e.g., `conventions-{language}.md`, `/grove-run` Step 3, `/grove-create`) which is good progressive disclosure. However, no bundle files were provided to verify these references exist, and the skill itself is quite long (~200+ lines) with Step 0's handoff logic being very detailed inline when it could potentially be a separate reference. The structure is reasonable but the handoff section adds significant bulk.

2 / 3

Total

10

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
mongodb/docs
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.