CtrlK
BlogDocsLog inGet started
Tessl Logo

tessl-labs/intent-integrity-kit

Closing the intent-to-code chasm - specification-driven development with BDD verification chain

86

1.82x
Quality

92%

Does it follow best practices?

Impact

86%

1.82x

Average score across 14 eval scenarios

SecuritybySnyk

Advisory

Suggest reviewing before use

Overview
Quality
Evals
Security
Files

criteria.jsonevals/scenario-11/

{
  "context": "Tests whether the agent generates tasks.md following IIKit's strict task format: sequential T-prefixed IDs, [P] markers only for parallelizable tasks, [USn] labels only for user story tasks, comma-separated TS-XXX references (not prose ranges), and the correct phase structure (Setup, Foundational, User Stories, Polish).",
  "type": "weighted_checklist",
  "checklist": [
    {
      "name": "Sequential T-prefixed IDs",
      "description": "All tasks have sequential IDs in the format T001, T002, T003... (three-digit zero-padded numbers with T prefix)",
      "max_score": 12
    },
    {
      "name": "[P] marker usage",
      "description": "Tasks that work on different files with no mutual dependencies are marked [P], and tasks with dependencies or shared file access are NOT marked [P]",
      "max_score": 10
    },
    {
      "name": "[USn] label on story tasks",
      "description": "Tasks that implement user story functionality include a [US1], [US2], or [US3] label, while Setup and Foundational tasks do NOT include a [USn] label",
      "max_score": 10
    },
    {
      "name": "Comma-separated TS references",
      "description": "When multiple test spec IDs are referenced, they are listed as a comma-separated list (e.g., [TS-001, TS-002]) NOT as prose ranges like 'TS-001 through TS-002'",
      "max_score": 15
    },
    {
      "name": "Phase 1 Setup section",
      "description": "tasks.md contains a Phase 1 (or equivalent) section for project initialization/setup tasks (e.g., project structure, pyproject.toml, database setup)",
      "max_score": 8
    },
    {
      "name": "Phase 2 Foundational section",
      "description": "tasks.md contains a Phase 2 (or equivalent) Foundational section for shared prerequisites that must complete before user story tasks (e.g., base models, database connection)",
      "max_score": 8
    },
    {
      "name": "User Story phases ordered by priority",
      "description": "US-1 (P1) and US-2 (P1) tasks appear in earlier phases than US-3 (P2) tasks, reflecting priority ordering from the spec",
      "max_score": 8
    },
    {
      "name": "File paths in descriptions",
      "description": "Task descriptions include specific file paths (e.g., 'src/models/item.py', 'tests/step_definitions/') rather than vague descriptions without paths",
      "max_score": 10
    },
    {
      "name": "Checkbox format",
      "description": "All tasks use the markdown checkbox format `- [ ]` (with space inside the brackets)",
      "max_score": 9
    },
    {
      "name": "Polish/Final phase",
      "description": "tasks.md contains a final Polish or Cross-Cutting Concerns phase for integration testing, documentation, or cleanup tasks",
      "max_score": 10
    }
  ]
}

evals

README.md

tile.json