Name: tessl-labs/intent-integrity-kit
Rating: 92.08 (1 reviews)
Author: tessl-labs

Blog Docs Log in Get started

tessl-labs/intent-integrity-kit

Closing the intent-to-code chasm - specification-driven development with BDD verification chain

1.53x

Quality

92%

Does it follow best practices?

Impact

92%

1.53x

Average score across 14 eval scenarios

Securityby

Advisory

Suggest reviewing before use

Evaluation results

100%

56%

Technical Design for Marketplace Search Feature

Criteria

Without context

With context

ASCII architecture diagram

100%

Named components in diagram

100%

context.json exists

100%

planview.nodeClassifications key

100%

Existing data preserved

100%

Client node classified

100%

Server node classified

100%

Storage node classified

100%

External node classified

100%

56%

Bug Report: Payment Processing Failure on Retry

Criteria

Without context

With context

T-BNNN task ID format

100%

At least 1 fix task with TS ref

100%

TDD task references test spec

100%

bugs.md BUG-NNN entry

100%

bugs.md required fields

70%

100%

bugs.md date format

100%

Existing tasks unmodified

100%

Bug ID in task descriptions

100%

New .feature file created

100%

97%

Constraint Survival: Offline-First Expense Tracker

Criteria

Without context

With context

No external API in core data path

100%

PM's specific service suggestions not adopted blindly

100%

Expense submission works offline in spec

100%

Currency conversion handled offline

100%

Local storage as primary in plan

100%

Sync designed as optional enhancement

100%

Conflict resolution addressed

100%

Spec uses numbered requirements

87%

100%

Spec has acceptance scenarios

50%

100%

Spec is technology-agnostic

62%

75%

Plan references spec requirements

71%

85%

No governance restated in plan

100%

Spec does not promise network-dependent core features

100%

98%

45%

Acceptance Test Suite for a User Notifications Feature

Criteria

Without context

With context

TS-XXX tags present

100%

FR-XXX tags present

80%

100%

SC-XXX tags present

100%

SC-XXX majority coverage

100%

US-XXX tags present

62%

100%

Priority tags present

50%

100%

Test type tags present

100%

DO NOT MODIFY header

83%

Feature-level US tag

100%

All acceptance scenarios covered

100%

TS-XXX uniqueness

100%

Failed

Project Governance Document for a Healthcare Data Platform

97%

58%

Greenfield Full Pipeline: Team Standup Bot

Criteria

Without context

With context

Spec is technology-agnostic

87%

100%

Plan has no governance content

50%

83%

FR-XXX to TS-XXX coverage

100%

TS-XXX to task coverage

100%

.feature files have DO NOT MODIFY headers

100%

.feature files have required tags

87%

Tasks ordered: Setup → Foundational → Stories → Polish

66%

100%

TDD task ordering within story phases

40%

100%

No phantom requirements

100%

Privacy constraint in .feature files

100%

No FR orphans in either direction

75%

87%

TS-XXX IDs are unique across files

100%

78%

47%

Technical Design for Notification Service Feature

Criteria

Without context

With context

plan.md exists

100%

Technical Context fields

100%

No bare Option labels

100%

research.md with rationale

37%

100%

data-model.md entities

100%

State transitions in data-model

50%

100%

contracts/ directory

33%

100%

Contracts reference spec requirements

75%

37%

quickstart.md exists with scenarios

100%

No governance content in plan

100%

Spec quality assessment performed

16%

Plan decisions trace to spec FRs

100%

87%

context.json updated

99%

21%

Plan-to-Tasks Traceability: Event Ticketing Platform

Criteria

Without context

With context

File paths match plan structure

86%

100%

Every user story has tagged tasks

50%

100%

Setup/Foundational tasks have no story tags

100%

TS references are comma-separated

100%

TS references match provided .feature files

100%

Priority ordering respected

100%

Phase structure complete

75%

100%

[P] markers only on parallelizable tasks

25%

87%

No technologies beyond the plan

100%

Checkbox format used

100%

Sequential T-prefixed IDs

66%

100%

95%

Scope Creep Detection: Simple Bookmark Manager

Criteria

Without context

With context

Exactly 3 user stories in spec

100%

No mentioned-but-deferred features in spec

100%

No excluded features in plan

100%

No excluded features in tasks

100%

Spec uses numbered requirements

75%

100%

Spec has acceptance scenarios

100%

FR count proportional to scope

62%

87%

Task count proportional to scope

100%

50%

Data model matches scope

100%

Plan tech stack appropriate for scope

100%

Tasks use structured format

75%

100%

Accessibility addressed

100%

63%

-4%

Update Technical Design: File Upload Feature

Criteria

Without context

With context

NEEDS CLARIFICATION flagged

100%

FR count assessed

37%

Measurable criteria warning

Quality score reported

Semantic diff present

100%

Semantic diff format

40%

20%

Downstream impact flagged

50%

75%

Updated plan has new dependencies

100%

Updated architecture diagram

100%

No governance in plan

100%

50%

Clarification assumptions documented

100%

86%

21%

Spec-to-Plan Phase Separation: IoT Fleet Management

Criteria

Without context

With context

No technology in spec.md

66%

100%

No governance in plan.md

41%

33%

FR-XXX requirements in spec

100%

SC-XXX success criteria in spec

16%

100%

User stories in spec

100%

Given/When/Then scenarios in spec

100%

Plan references spec FRs

40%

60%

Every spec FR traceable to plan

100%

83%

No phantom requirements in plan

100%

data-model.md traces to spec entities

100%

Connection-lost requirement survives to plan

100%

97%

59%

Feature Specification: Team Document Collaboration

Criteria

Without context

With context

No technology stack in spec

13%

100%

FR-XXX numbered requirements

80%

100%

SC-XXX success criteria

100%

Given/When/Then scenarios

100%

User stories present

100%

Measurable success criteria

100%

Max 3 NEEDS CLARIFICATION

100%

No implementation details

100%

2-4 word branch name

80%

70%

Requirements.md checklist created

100%

97%

31%

Task Breakdown for an Inventory Management API

Criteria

Without context

With context

Sequential T-prefixed IDs

58%

100%

[P] marker usage

20%

80%

[USn] label on story tasks

20%

100%

Comma-separated TS references

100%

Phase 1 Setup section

100%

Phase 2 Foundational section

50%

100%

User Story phases ordered by priority

100%

File paths in descriptions

100%

90%

Checkbox format

100%

Polish/Final phase

100%

98%

24%

TDD Pipeline with Constitution Enforcement: Appointment Scheduling API

Criteria

Without context

With context

.feature files cover all FRs

41%

100%

DO NOT MODIFY headers present

100%

All required tags on every Scenario

87%

TS-XXX IDs unique across all files

83%

100%

Scenarios match spec acceptance criteria

90%

100%

Privacy/auth scenario present

100%

Every story task references TS-XXX

93%

100%

TS references are comma-separated

100%

Test tasks before production tasks per story

100%

Task file paths match plan structure

100%

87%

Phase structure and sequential IDs

60%

100%

Concurrency test scenario present

100%

Evaluated: 11 days ago
Agent: Claude Code
Model: Claude Sonnet 4.6

Table of Contents

Technical Design for Marketplace Search Feature Bug Report: Payment Processing Failure on Retry Constraint Survival: Offline-First Expense Tracker Acceptance Test Suite for a User Notifications Feature Greenfield Full Pipeline: Team Standup Bot Technical Design for Notification Service Feature Plan-to-Tasks Traceability: Event Ticketing Platform Scope Creep Detection: Simple Bookmark Manager Update Technical Design: File Upload Feature Spec-to-Plan Phase Separation: IoT Fleet Management Feature Specification: Team Document Collaboration Task Breakdown for an Inventory Management API TDD Pipeline with Constitution Enforcement: Appointment Scheduling API