Name: tdg-personal/agent-eval
Rating: 57.599999999999994 (1 reviews)
Author: tdg-personal

tdg-personal/agent-eval

Head-to-head comparison of coding agents (Claude Code, Aider, Codex, etc.) on custom tasks with pass rate, cost, time, and consistency metrics

Quality

72%

Does it follow best practices?

Run evals on this skill

Adds up to 20 points to the overall score

View guide

Securityby

Passed

No findings from the security scan

Scanned 3 months ago