CtrlK
BlogDocsLog inGet started
Tessl Logo

agent-challenges

Agent skill for challenges - invoke with $agent-challenges

Install with Tessl CLI

npx tessl i github:ruvnet/claude-flow --skill agent-challenges
What are skills?

40

1.59x

Does it follow best practices?

Evaluation99%

1.59x

Agent success when using this skill

Validation for skill structure

SKILL.md
Review
Evals

Evaluation results

100%

30%

Coding Challenge Content Creation

Challenge quality standards

Criteria
Without context
With context

Recognized categories

0%

100%

Recognized difficulty levels

0%

100%

Worked examples

100%

100%

Explicit constraints

100%

100%

Edge case test cases

100%

100%

Performance benchmarks

100%

100%

Detailed scoring rubric

100%

100%

Different categories

100%

100%

Without context: $0.1600 · 1m · 8 turns · 13 in / 3,557 out tokens

With context: $0.4799 · 2m 18s · 20 turns · 270 in / 7,783 out tokens

100%

81%

Flow Nexus Challenge Dashboard Integration

MCP tool integration

Criteria
Without context
With context

challenges_list tool name

0%

100%

challenges_list difficulty param

100%

100%

challenges_list status param

0%

100%

challenge_submit tool name

0%

100%

challenge_submit execution_time

30%

100%

challenge_submit language

100%

100%

achievements_list tool name

0%

100%

leaderboard_get tool name

0%

100%

leaderboard type param

0%

100%

solution_code param name

0%

100%

Without context: $0.2391 · 55s · 14 turns · 15 in / 3,203 out tokens

With context: $0.2681 · 53s · 16 turns · 176 in / 2,921 out tokens

99%

2%

Personalized Learning Platform Design

Curation workflow and gamification design

Criteria
Without context
With context

Skill assessment step

100%

100%

Challenge selection step

100%

100%

Solution guidance step

100%

87%

Performance analysis step

62%

100%

Progress tracking step

100%

100%

Community engagement step

100%

100%

Dynamic multi-factor scoring

100%

100%

Progressive achievement system

100%

100%

Multi-category/timeframe leaderboard

100%

100%

Learning streaks

100%

100%

Platform credit economy

100%

100%

Social features

100%

100%

Without context: $0.5530 · 4m 7s · 17 turns · 24 in / 11,320 out tokens

With context: $0.4839 · 3m 1s · 19 turns · 24 in / 8,403 out tokens

Evaluated
Agent
Claude Code
Model
Unknown

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.