Systematic improvement of existing agents through performance analysis, prompt engineering, and continuous iteration.
Install with Tessl CLI
npx tessl i github:sickn33/antigravity-awesome-skills --skill agent-orchestration-improve-agent59
Quality
43%
Does it follow best practices?
Impact
81%
1.24xAverage score across 3 eval scenarios
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/agent-orchestration-improve-agent/SKILL.mdPerformance baseline analysis
context-manager command
0%
100%
Task completion rate metric
83%
100%
Corrections per task metric
66%
100%
Tool call efficiency metric
0%
0%
User satisfaction score
0%
100%
Latency and token metrics
0%
0%
All six failure categories
53%
46%
All five feedback patterns
60%
100%
Hallucination incidents tracked
100%
100%
Baseline report section
37%
50%
Failure mode examples from logs
100%
100%
No deployment recommendations
83%
0%
Without context: $0.5408 · 2m 47s · 14 turns · 63 in / 9,280 out tokens
With context: $0.6222 · 2m 46s · 21 turns · 328 in / 9,345 out tokens
Prompt engineering improvements
Step-by-step phrasing
100%
100%
Verification checkpoint phrasing
100%
100%
Both positive and negative examples
25%
100%
Few-shot example structure
50%
100%
Core purpose statement
100%
100%
Constraints section
100%
100%
Success criteria defined
60%
80%
Tool proficiency defined
60%
0%
Five constitutional principles
73%
80%
Critique-and-revise loop
70%
80%
Structured output templates
100%
100%
Examples ordered simply to complex
80%
80%
prompt-engineer technique named
100%
100%
Without context: $0.3691 · 2m 14s · 12 turns · 12 in / 6,633 out tokens
With context: $0.5747 · 2m 48s · 23 turns · 19 in / 8,305 out tokens
Deployment and version control planning
Version format used
100%
100%
MINOR vs MAJOR distinction
80%
100%
Git-based storage
100%
100%
Changelog requirement
100%
100%
Six test suite categories
60%
100%
parallel-test-runner referenced
0%
0%
100-task minimum per variant
60%
100%
Statistical significance threshold
100%
40%
Cohen's d effect size
100%
100%
Five-stage rollout
80%
90%
Rollback triggers present
37%
100%
Success criteria thresholds
12%
50%
30-day review plan
42%
100%
Regression testing before deploy
100%
100%
Rollback process steps
87%
100%
Without context: $0.3873 · 2m 29s · 11 turns · 11 in / 7,709 out tokens
With context: $0.5933 · 3m 27s · 15 turns · 16 in / 10,795 out tokens
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.