Name: pantheon-ai/test-driven-development
Author: pantheon-ai

pantheon-ai/test-driven-development

Master Test-Driven Development with deterministic red-green-refactor workflows, test-first feature delivery, bug reproduction through failing tests, behavior-focused assertions, and refactoring safety; use when implementing new functions, changing APIs, fixing regressions, or restructuring code under test.

Review — 100%

Does it follow best practices?

Evaluation — 86%

↑ 1.05x

Agent success when using this tile

Validation — 11 / 11 Passed

Validation for skill structure

title:: Test Behavior Not Implementation
impact:: CRITICAL
impactDescription:: reduces test brittleness by 50-80%
tags:: design, behavior, black-box, maintainability

Test Behavior Not Implementation

Tests should verify what the code does (behavior), not how it does it (implementation). Implementation-coupled tests break during refactoring even when behavior is preserved.

Incorrect (testing implementation details):

test('sortUsers calls quicksort with correct comparator', () => {
  const quicksortSpy = jest.spyOn(sortUtils, 'quicksort')

  sortUsers(users, 'name')

  expect(quicksortSpy).toHaveBeenCalledWith(
    users,
    expect.any(Function)
  )
  // Breaks if we switch to mergesort, even though behavior is identical
})

test('caches user in internal Map', () => {
  const service = new UserService()
  service.getUser('123')

  // Testing private implementation detail
  expect(service['cache'].has('123')).toBe(true)
  // Breaks if we change cache structure
})

Correct (testing observable behavior):

test('sortUsers returns users ordered by name', () => {
  const users = [
    { name: 'Charlie', id: '1' },
    { name: 'Alice', id: '2' },
    { name: 'Bob', id: '3' }
  ]

  const sorted = sortUsers(users, 'name')

  expect(sorted.map(u => u.name)).toEqual(['Alice', 'Bob', 'Charlie'])
  // Passes regardless of sorting algorithm used
})

test('getUser returns same instance on repeated calls', () => {
  const service = new UserService()

  const first = await service.getUser('123')
  const second = await service.getUser('123')

  expect(first).toBe(second)
  // Tests caching behavior without knowing how it's implemented
})

Ask yourself:

Would this test break if I refactored the internals?
Am I testing public API or private implementation?
Does this test document what users of this code care about?

Reference: Testing Implementation Details - Kent C. Dodds

Install with Tessl CLI

npx tessl i pantheon-ai/test-driven-development

evals

references

assert-custom-matchers.md

assert-error-messages.md

assert-no-assertions-antipattern.md

assert-snapshot-testing.md

assert-specific-assertions.md

cycle-maintain-test-list.md

cycle-minimal-code-to-pass.md

cycle-refactor-after-green.md

cycle-small-increments.md

cycle-verify-test-fails-first.md

cycle-write-test-first.md

data-avoid-mystery-guests.md

data-builder-pattern.md

data-minimal-setup.md

data-unique-identifiers.md

data-use-factories.md

design-aaa-pattern.md

design-avoid-logic-in-tests.md

design-descriptive-test-names.md

design-one-assertion-per-test.md

design-test-behavior-not-implementation.md

design-test-edge-cases.md

isolate-deterministic-tests.md

isolate-mock-external-dependencies.md

isolate-no-shared-state.md

isolate-prefer-stubs-over-mocks.md

isolate-use-dependency-injection.md

org-file-structure.md

org-group-by-behavior.md

org-parameterized-tests.md

org-setup-teardown.md

org-test-utilities.md

perf-avoid-network-calls.md

perf-avoid-sleep.md

perf-fast-unit-tests.md

perf-fix-flaky-tests.md

perf-parallelize-tests.md

strat-coverage-targets.md

strat-e2e-critical-paths.md

strat-integration-boundaries.md

strat-mutation-testing.md

strat-test-pyramid.md

SKILL.md

tile.json

pantheon-ai/test-driven-development

design-test-behavior-not-implementation.md.css-3qkkll{font-size:var(--chakra-font-sizes-sm);font-weight:var(--chakra-font-weights-normal);color:var(--chakra-colors-gray-300);}references/

Test Behavior Not Implementation

design-test-behavior-not-implementation.mdreferences/