Curated library of 16 public Ruby AI agent skills covering TDD, refactoring, code review, security review, DDD, YARD documentation, and common design patterns.
94
96%
Does it follow best practices?
Impact
94%
1.13xAverage score across 16 eval scenarios
Advisory
Suggest reviewing before use
{
"context": "Checks whether the final artifact follows the test-planning-process instructions from the published Ruby Core Skills plugin.",
"type": "weighted_checklist",
"checklist": [
{
"name": "instruction-1",
"description": "The submitted artifact follows this skill instruction: The first failing test MUST be identified explicitly in the plan before writing it.",
"max_score": 17
},
{
"name": "instruction-2",
"description": "The submitted artifact follows this skill instruction: All test data must be synthetic; never use real production values or API payloads in test plans.",
"max_score": 17
},
{
"name": "instruction-3",
"description": "The submitted artifact follows this skill instruction: **Request / API Boundary:** Use when verifying HTTP statuses, headers, query parsing, or JSON payload structures.",
"max_score": 17
},
{
"name": "instruction-4",
"description": "The submitted artifact follows this skill instruction: **Service / Business Boundary:** Use when validating domain invariants, complex calculations, or coordination between objects.",
"max_score": 17
},
{
"name": "instruction-5",
"description": "The submitted artifact follows this skill instruction: **Unit Boundary:** Use when verifying low-level calculations, state changes on a single object, or formatting logic.",
"max_score": 16
},
{
"name": "instruction-6",
"description": "The submitted artifact follows this skill instruction: Run the skeleton as-is — it should fail (Red). Proceed to `tdd-process` to make it pass.",
"max_score": 16
}
]
}.tessl-plugin
evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
scenario-11
scenario-12
scenario-13
scenario-14
scenario-15
scenario-16
skills
code-quality
respond-to-review
ddd
define-domain-language
model-domain
review-domain-boundaries
docs
write-yard-docs
orchestration
skill-router
patterns
create-service-object
implement-calculator-pattern
planning
generate-tdd-tasks
process
testing
triage-bug