eval-harness

Formal evaluation framework for Claude Code sessions implementing eval-driven development (EDD) principles

Quality

50%

Does it follow best practices?

Run evals on this skill

Adds up to 20 points to the overall score

View guide

Securityby

Medium

Suggest reviewing before use

Fix and improve this skill with Tessl

tessl review fix ./.agents/skills/eval-harness/SKILL.md

The canonical home for this skill is jbvc/eval-harness

Loading evals

Repository: affaan-m/everything-claude-code
Path: .agents/skills/eval-harness/SKILL.md
Commit: 754b8dd

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.