Content creator for tessl.io — generates publish-ready blog articles with SEO metadata, Tessl house style, and technical authority.
90
79%
Does it follow best practices?
Impact
92%
1.26xAverage score across 10 eval scenarios
Passed
No known issues
The Tessl content team wants to publish a comparison article that helps developers choose between the leading agent evaluation frameworks currently available. The audience is engineers who are actively building AI agents and have reached the point where they need a structured approach to measuring agent accuracy, but haven't yet committed to a framework.
The content lead's brief:
"This needs to be genuinely useful, not a feature matrix that tells people nothing. Pick the dimensions that actually matter for this decision and give us your honest take on when to use each. Don't just describe features — tell us who each framework is right for. Be specific. Opinionated is fine; wishy-washy is not. Publish-ready with full metadata."
Compare the following three agent evaluation frameworks (invented but realistic):
EvalKit
Orion Evals
spec-eval
Write a publish-ready comparison article for the tessl.io blog. The article should:
Use suggestive language when describing claimed benefits from any of the three frameworks.
Save the completed article as article.md in the current working directory.
The file must include a metadata block at the top (title, type, primary keyword, meta description, URL slug, internal links, estimated read time) followed by the full article body in markdown.
evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
skills
article-creator