Visual UI annotation tool for AI agents. Drop the React toolbar into any app — humans click elements and leave feedback, agents receive structured CSS selectors, bounding boxes, and React component trees to find exact code. Supports MCP watch-loop, platform-specific hooks (Claude Code / Codex / Gemini CLI / OpenCode), webhook delivery, and autonomous self-driving critique with agent-browser.
75
63%
Does it follow best practices?
Impact
96%
1.52xAverage score across 3 eval scenarios
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./.agent-skills/agentation/SKILL.mdQuality
Discovery
50%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description excels at technical specificity and carves out a distinctive niche for AI-assisted UI annotation workflows. However, it critically lacks any 'Use when...' guidance, making it difficult for Claude to know when to select this skill. The language is heavily technical/jargon-laden, which may not match how users naturally request this functionality.
Suggestions
Add an explicit 'Use when...' clause with trigger scenarios, e.g., 'Use when the user needs to annotate UI elements, collect visual feedback, or help AI agents locate specific components in a web application.'
Include more natural user-facing trigger terms alongside technical ones, such as 'visual feedback', 'element selection', 'UI debugging', or 'component identification'.
Consider adding common task-based triggers like 'when reviewing UI changes', 'when debugging element positioning', or 'when agents need to interact with specific UI components'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'Drop the React toolbar into any app', 'humans click elements and leave feedback', 'agents receive structured CSS selectors, bounding boxes, and React component trees'. Also mentions specific features like MCP watch-loop, platform hooks, webhook delivery, and autonomous critique. | 3 / 3 |
Completeness | Describes WHAT the tool does in detail but completely lacks a 'Use when...' clause or any explicit trigger guidance. There's no indication of when Claude should select this skill over others. Per rubric guidelines, missing explicit trigger guidance caps completeness at 2, and this has none at all. | 1 / 3 |
Trigger Term Quality | Contains some relevant technical terms like 'UI annotation', 'CSS selectors', 'React', 'bounding boxes', but these are developer-centric jargon. Missing natural user phrases like 'annotate UI', 'mark elements', 'visual feedback'. Platform names (Claude Code, Codex, Gemini CLI) are specific but niche. | 2 / 3 |
Distinctiveness Conflict Risk | Highly distinctive niche: 'Visual UI annotation tool for AI agents' with specific technical scope (React toolbar, CSS selectors, bounding boxes, component trees). The combination of AI agent feedback, specific platform hooks, and agent-browser integration creates a clear, unique identity unlikely to conflict with other skills. | 3 / 3 |
Total | 9 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a comprehensive, highly actionable skill with excellent executable examples and clear workflow patterns. The main weakness is verbosity - multiple redundant setup paths are listed inline, and the document would benefit from splitting platform-specific configs and the setup script into separate reference files. The workflow clarity is strong with explicit lifecycle states and validation checkpoints.
Suggestions
Extract the 50-line setup-agentation-mcp.sh script to a separate file and reference it with a one-liner
Consolidate the 5+ different Claude Code setup methods into a single recommended approach with alternatives in a separate SETUP.md
Move platform-specific hook configurations (Section 4 subsections) to a PLATFORMS.md reference file, keeping only the universal npx add-mcp command inline
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is comprehensive but includes redundant setup options (multiple ways to do the same thing listed sequentially), repeated information across sections, and some explanatory text that could be trimmed. The architecture section and some hook examples are efficient, but overall the document could be 30-40% shorter. | 2 / 3 |
Actionability | Excellent executable examples throughout - complete React component code, bash commands, JSON configs, TypeScript types, and curl commands are all copy-paste ready. The MCP tool table with parameters and the full annotation schema provide concrete, actionable reference. | 3 / 3 |
Workflow Clarity | Clear workflow patterns (Section 6) with explicit sequences: Copy-Paste, MCP Watch Loop, Hook Injection, and Autonomous Self-Driving. The watch loop pattern includes acknowledge→fix→resolve lifecycle with validation. The jeo integration section shows phase guards and pre-flight checks. | 3 / 3 |
Progressive Disclosure | The document is well-structured with numbered sections and tables, but it's monolithic at ~600 lines. Advanced topics like jeo integration, webhook setup, and platform-specific hooks could be split into separate reference files. The inline bash script (setup-agentation-mcp.sh) adds significant bulk that could be externalized. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
skill_md_line_count | SKILL.md is long (778 lines); consider splitting into references/ and linking | Warning |
Total | 10 / 11 Passed | |
c033769
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.