CtrlK
BlogDocsLog inGet started
Tessl Logo

igmarin/rails-agent-skills

Curated library of AI agent skills for Ruby on Rails development. Covers code review, architecture, security, testing (RSpec), engines, service objects, DDD patterns, and workflow automation.

98

1.38x
Quality

99%

Does it follow best practices?

Impact

98%

1.38x

Average score across 26 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

criteria.jsonevals/scenario-19/

{
  "context": "Tests whether the agent follows the refactor-safely protocol: writing characterization tests that pass on the existing code before any structural change, explicitly stating stable behavior, proposing a minimal step-by-step sequence, verifying tests after each step with concrete evidence, and avoiding mixing behavior changes with structural work.",
  "type": "weighted_checklist",
  "checklist": [
    {
      "name": "Stable behavior statement",
      "description": "A written statement declaring which behaviors must not change is present before or alongside the refactoring plan — not implied, but explicitly stated",
      "max_score": 10
    },
    {
      "name": "Characterization tests before refactoring",
      "description": "Spec or test file content appears in the process log or workspace before any implementation-level structural change is described — test code is NOT introduced after the new service file",
      "max_score": 15
    },
    {
      "name": "Characterization tests target current code",
      "description": "The characterization tests are written against the existing code (the fat controller or original structure), not the extracted service — they protect existing behavior, not new behavior",
      "max_score": 10
    },
    {
      "name": "Smallest safe sequence proposed",
      "description": "The refactoring plan lists discrete steps in sequence (numbered or ordered), extracting one boundary per step, not a single all-at-once rewrite",
      "max_score": 10
    },
    {
      "name": "Test run evidence per step",
      "description": "The process log contains at least two entries that report concrete test run results (e.g. pass count, zero failures, exit code) at different points in the sequence — NOT just a final result",
      "max_score": 15
    },
    {
      "name": "No behavior mixed with structure",
      "description": "The refactoring does NOT alter application logic (add discount rules, change tax calculation, add validations) in the same commit or step as the structural extraction",
      "max_score": 12
    },
    {
      "name": "Temporary compatibility noted",
      "description": "If the public interface changes (e.g. the controller action delegates to a new class), the process log or plan explicitly identifies any interim shim or compatibility wrapper and states when it will be removed",
      "max_score": 8
    },
    {
      "name": "No forbidden confidence claims",
      "description": "The process log does NOT contain any of the following unsupported claims without accompanying test evidence: 'should work', 'looks correct', 'I'm confident', or 'this is fine'",
      "max_score": 10
    },
    {
      "name": "One boundary extracted per step",
      "description": "Each numbered step in the plan/log touches a single extraction target (e.g. only the pricing logic, or only the notification dispatch) — not multiple unrelated concerns in one step",
      "max_score": 10
    }
  ]
}

README.md

tile.json