Agent skill for test-long-runner - invoke with $agent-test-long-runner
Install with Tessl CLI
npx tessl i github:ruvnet/claude-flow --skill agent-test-long-runner42
Does it follow best practices?
If you maintain this skill, you can automatically optimize it using the tessl CLI to improve its score:
npx tessl skill review --optimize ./path/to/skillEvaluation — 96%
↓ 0.98xAgent success when using this skill
Validation for skill structure
Security audit report with code examples and action items
Section headers present
100%
100%
All major vulnerabilities covered
100%
100%
Reasoning per issue
100%
100%
Corrected code examples
100%
100%
Vulnerable code cited
100%
100%
References or standards cited
100%
100%
Action items or next steps
100%
100%
Severity or priority ranking
100%
100%
SQL injection addressed
100%
100%
Secret key exposure addressed
100%
100%
SSRF / unvalidated URL proxy addressed
100%
100%
Admin route missing auth addressed
100%
100%
Without context: $0.3500 · 2m 7s · 8 turns · 11 in / 7,907 out tokens
With context: $0.4632 · 2m 26s · 18 turns · 23 in / 7,888 out tokens
Architecture design document with text diagrams
Section headers used
100%
100%
Text-based diagram present
100%
100%
Multiple diagrams
100%
100%
Design decision reasoning
100%
100%
Comprehensive component coverage
100%
100%
Cache invalidation strategy
100%
100%
Failure modes addressed
100%
100%
References to technologies or patterns
100%
100%
Action items or next steps
100%
70%
Incremental adoption addressed
100%
100%
Non-trivial detail depth
100%
100%
Without context: $0.4069 · 2m 45s · 13 turns · 20 in / 8,362 out tokens
With context: $0.6407 · 3m 55s · 23 turns · 63 in / 12,230 out tokens
Research report with progress documentation and citations
Progress log produced
100%
100%
Progress log shows evolution
62%
87%
Section headers in report
100%
100%
All four databases covered
100%
100%
Multiple comparison dimensions
100%
100%
Citations or named references
100%
100%
Reasoning documented
100%
100%
Recommendation section
100%
100%
Action items or next steps
25%
25%
Report depth
100%
100%
Managed vs. self-hosted addressed
100%
100%
Indexing algorithm mentioned
100%
100%
Without context: $0.5567 · 3m 41s · 20 turns · 27 in / 10,367 out tokens
With context: $0.5373 · 3m 6s · 24 turns · 30 in / 9,340 out tokens
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.