CtrlK
BlogDocsLog inGet started
Tessl Logo

tessl-labs/eval-setup

Generate eval scenarios from repo commits, configure multi-agent runs, execute baseline + with-context evals, and compare results — the full setup pipeline before improvement begins

90

3.37x
Quality

90%

Does it follow best practices?

Impact

91%

3.37x

Average score across 2 eval scenarios

SecuritybySnyk

Advisory

Suggest reviewing before use

Overview
Quality
Evals
Security
Files

task.mdevals/scenario-1/

The user says: "I want to understand if my CLAUDE.md is actually helping my AI agent. I have a monorepo at acme/backend on GitHub with TypeScript and Node.js code. I've never run any evals before — can you walk me through it?"

Walk the user through setting up and running a Tessl codebase eval on their repository from start to finish.

evals

scenario-1

criteria.json

task.md

README.md

tile.json