Use when reviewing PRs, simplifying over-engineered code, judging architecture, renaming unclear concepts, or breaking an agent/human out of a too-clean path: code too fancy, too abstract, too clever, too many files, too many helpers, too many layers, too well-factored but painful to change. Embody Grug brain: complexity very bad, small words, no consultant speak, no hard pivot to opposite dogma.
86
83%
Does it follow best practices?
Impact
89%
1.21xAverage score across 3 eval scenarios
Passed
No known issues
Quality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong description with excellent trigger term coverage and a very clear 'Use when' clause that makes it easy for Claude to know when to select this skill. Its distinctive personality ('Grug brain') and specific focus on over-engineering/complexity reduction give it a clear niche. The main weakness is that the concrete actions could be slightly more specific — some of the description blends capability with stylistic guidance.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names a domain (code review/simplification) and lists several actions (reviewing PRs, simplifying over-engineered code, renaming unclear concepts, breaking out of too-clean paths), but the actions are somewhat loosely defined and blend into stylistic guidance rather than concrete discrete operations. | 2 / 3 |
Completeness | The description opens with an explicit 'Use when...' clause that clearly answers both what (simplifying over-engineered code, reviewing PRs, renaming, judging architecture) and when (code too fancy, too abstract, too clever, too many files/helpers/layers). Both dimensions are well-covered with explicit triggers. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms users would actually say: 'reviewing PRs', 'over-engineered', 'too abstract', 'too clever', 'too many files', 'too many layers', 'too many helpers', 'renaming unclear concepts', 'complexity'. These are phrases developers naturally use when frustrated with overly complex code. | 3 / 3 |
Distinctiveness Conflict Risk | This skill occupies a very clear niche — the 'grug-brained' anti-complexity code review perspective. The distinctive voice ('Grug brain', 'small words', 'no consultant speak') and specific triggers around over-engineering make it highly unlikely to conflict with general code review or refactoring skills. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a creative and surprisingly actionable skill that uses an unconventional voice as a deliberate cognitive tool to force simplicity in code review. Its strengths are strong actionability (concrete questions, clear output format, specific examples of good/bad patterns) and clear workflow sequencing. Its main weaknesses are overall length—the philosophical/narrative sections could be trimmed—and the monolithic structure that would benefit from splitting detailed hunting criteria into separate reference files.
Suggestions
Trim the 'Grug remember' origin story section significantly—it sets tone but costs ~150 tokens that don't add actionable guidance beyond what 'Become Grug' already provides.
Consider splitting 'What Grug hunts' subsections (APIs, Tests, Tools, Expressions, etc.) into a separate HUNTING_GUIDE.md reference file to reduce the main skill's token footprint.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is lengthy (~400+ lines) and includes extensive philosophical/narrative content (the fire-side origin story, repeated 'complexity bad' mantras). While the Grug voice is intentionally terse per-sentence, the overall document is verbose with significant repetition of the same core idea (complexity = demon). Some sections like 'Grug enemy' list many demon costumes that could be condensed. However, the voice itself enforces brevity at the sentence level and most sections do add distinct guidance. | 2 / 3 |
Actionability | Despite the unconventional voice, the skill provides highly concrete, actionable guidance: specific questions to ask during review, exact output format templates with markdown sections, concrete examples of good vs bad names, clear criteria for when helpers earn their place, specific verdicts to render ('Fancy box not earn food. Smash.'). The output template is copy-paste ready and the 'bonk' planning format gives executable structure. | 3 / 3 |
Workflow Clarity | The workflow is clearly sequenced: adopt voice → ask Grug questions → inspect code for meat/demon/crystal → judge each stone → report using structured output format → end with 'next bonk' (smallest safe action + check). The 'Plan as bonks' section explicitly includes validation ('Run tests. Stop.'). The refactor section emphasizes staying near shore with working code after each step. For a review/refactor skill, this is well-structured with appropriate checkpoints. | 3 / 3 |
Progressive Disclosure | The content is a single monolithic file with no references to external files or bundle resources. For a skill this long (~400+ lines), some sections (like the detailed 'What Grug hunts' subsections on APIs, tests, tools, expressions, etc.) could be split into referenced files. However, the internal organization with clear headers and subsections is good, and the content flows logically from philosophy → method → inspection criteria → output format. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (561 lines); consider splitting into references/ and linking | Warning |
Total | 10 / 11 Passed | |
40067f1
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.