CtrlK
BlogDocsLog inGet started
Tessl Logo

tdg-personal/agent-eval

Head-to-head comparison of coding agents (Claude Code, Aider, Codex, etc.) on custom tasks with pass rate, cost, time, and consistency metrics

72

Quality

72%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

No security issues found

Scanned

Audited
Security analysis
Snyk