Collect and document UX feedback when using tools as an AI agent.
80
100%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Loading evals