CtrlK
BlogDocsLog inGet started
Tessl Logo

pantheon-ai/tessl-publish-public

Ensure Tessl tiles meet all requirements for public registry publishing with comprehensive validation, quality gates, and evaluation scenarios. Use when preparing skills for public Tessl release, validating tile.json configuration, creating evaluation scenarios, enforcing quality thresholds, or checking agent-agnostic compliance. Keywords: tessl, tile, publishing, public-registry, validation, quality-gates, tile.json, evaluation-scenarios, skill-publishing

94

Quality

94%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

scenario-07.mdevaluation-scenarios/

Scenario 07: Tessl Optimization Impact

User Prompt

"The typescript-advanced skill scored 86% on Tessl review. Optimize it for public publishing."

Expected Behavior

  1. Agent recognizes 86% is below 90% threshold
  2. Agent runs tessl skill review skills/development/typescript-advanced --optimize
  3. Agent waits for optimization to complete
  4. Agent runs tessl skill review skills/development/typescript-advanced again to check improvements
  5. Agent compares before (86%) and after scores
  6. Agent explains what optimization changed/improved
  7. If now ≥90%, agent confirms ready for next publication steps
  8. If still <90%, agent suggests manual improvements needed

Success Criteria

  • Agent identifies 86% is below 90% threshold
  • Agent uses --optimize flag correctly
  • Agent runs review twice (before and after optimization)
  • Agent reports score improvement (e.g., 86% → 94%)
  • Agent explains optimization impact clearly
  • Agent confirms next steps based on final score
  • Agent documents the optimization results
  • Agent doesn't proceed to publish if still <90%

Failure Conditions

  • Agent skips optimization step
  • Agent publishes at 86% without optimization attempt
  • Agent runs optimization but doesn't re-review
  • Agent doesn't compare before/after scores
  • Agent proceeds to publish if still <90% after optimization
  • Agent misinterprets optimization flag usage
  • Agent suggests lowering threshold instead of optimizing

evaluation-scenarios

scenario-01.md

scenario-02.md

scenario-03.md

scenario-04.md

scenario-05.md

scenario-06.md

scenario-07.md

scenario-08.md

scenario-09.md

SKILL.md

tile.json