agent-orchestration-improve-agent

Systematic improvement of existing agents through performance analysis, prompt engineering, and continuous iteration.

Quality

37%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./.agent/skills/agent-orchestration-improve-agent/SKILL.md

Quality

Discovery

32%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description identifies a clear domain (agent improvement) but relies on abstract terminology rather than concrete actions. The complete absence of a 'Use when...' clause significantly weakens its utility for skill selection, and the trigger terms used are more technical than what users would naturally say.

Suggestions

Add an explicit 'Use when...' clause with natural trigger terms like 'agent not working', 'improve my agent', 'agent performance issues', 'fix agent behavior', or 'optimize agent prompts'.

Replace abstract actions with concrete specifics: instead of 'performance analysis', say 'analyze agent logs and error patterns, identify failure modes, measure success rates'.

Include file type or artifact triggers if applicable, such as 'when working with agent configuration files, system prompts, or agent evaluation results'.

Dimension	Reasoning	Score
Specificity	Names the domain (agent improvement) and some actions (performance analysis, prompt engineering, continuous iteration), but these are somewhat abstract rather than concrete specific actions like 'analyze error logs' or 'rewrite system prompts'.	2 / 3
Completeness	Describes what the skill does but completely lacks a 'Use when...' clause or any explicit trigger guidance. Per rubric guidelines, missing explicit trigger guidance caps completeness at 2, and this is weaker than that threshold.	1 / 3
Trigger Term Quality	Includes some relevant terms like 'agents', 'prompt engineering', and 'performance analysis', but missing common variations users might say like 'fix my agent', 'agent not working', 'improve prompts', 'debug agent', or 'optimize agent'.	2 / 3
Distinctiveness Conflict Risk	The focus on 'agents' provides some specificity, but 'prompt engineering' and 'performance analysis' could overlap with general coding skills, debugging skills, or other AI-related skills.	2 / 3
	Total	7 / 12 Passed

Implementation

42%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill excels at progressive disclosure with a well-organized hub pointing to detailed sub-skills, but fails on actionability by providing only abstract guidance without any concrete examples, code, or commands at the top level. The workflow is present but lacks explicit validation checkpoints that would be expected for production agent optimization.

Suggestions

Add at least one concrete, executable example in the main skill (e.g., a sample metric collection script or a before/after prompt comparison)

Integrate validation checkpoints into the 4-step workflow (e.g., 'After step 2, verify failure modes are reproducible before proceeding')

Include a minimal inline example of what a 'baseline metric' or 'failure mode classification' looks like rather than deferring everything to sub-skills

Dimension	Reasoning	Score
Conciseness	The content is reasonably efficient but includes some unnecessary framing (extended thinking block, emoji headers) and the 'Use this skill when' sections add moderate overhead. The core instructions are lean but could be tighter.	2 / 3
Actionability	The skill provides only abstract, high-level guidance ('Establish baseline metrics', 'Identify failure modes') with no concrete code, commands, or executable examples. All actionable content is deferred to sub-skills without any inline demonstration.	1 / 3
Workflow Clarity	The four-step workflow is listed but lacks explicit validation checkpoints between steps. Safety section mentions rollback but doesn't integrate it into the workflow sequence. No feedback loops for error recovery are defined at this level.	2 / 3
Progressive Disclosure	Excellent structure with a clear overview and well-organized one-level-deep references to 17 sub-skills. Navigation is clear with numbered sections and descriptive links. Content is appropriately split between overview and detailed modules.	3 / 3
	Total	8 / 12 Passed

Validation

90%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 10 / 11 Passed

Validation for skill structure

Criteria	Description	Result
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	10 / 11 Passed

Repository: Dokhacgiakhoa/antigravity-ide
Commit: 332e58b

Reviewed: 2 months ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.