ainativedev/latest-aidevcon-speakers-london-2026

AI Native DevCon 2026 London — all conference sessions as interactive skills

Quality

89%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Risky

Do not use without reviewing

Outline - When Tests Lie: Using Observability to Keep AI Honest

Name: ainativedev/latest-aidevcon-speakers-london-2026
Rating: 71.77 (1 reviews)
Author: ainativedev

Speaker

Justin Cormack

Abstract

[inferred from filename and transcript] When Tests Lie: Using Observability to Keep AI Honest is an AI Native DevCon session covering tests and observability, AI-generated software validation, runtime signals, test reliability, and keeping AI honest.

Thesis

[inferred] The talk's main contribution is its framing of tests and observability, AI-generated software validation, runtime signals for practitioners working with AI-native software development.

Transcript Status

The source is timestamped speech-to-text output. Speaker labels, punctuation, and some technical terms may be imperfect. Use timestamps when citing.

Timeline

#	Timestamp range	Section	Summary
1	00:00-03:41	Opening and framing	This this feedback loop that we really need to understand if AI is is truly helping us in our production code or not.
2	03:42-07:31	Main discussion 1	It's like, if we can do this in less time, we can build more of these interesting systems.
3	07:35-11:49	Main discussion 2	And you can build a test suite against a simple version first.
4	11:54-15:57	Main discussion 3	I kept finding edge cases by reading the AWS documentation thinking.
5	16:00-19:48	Main discussion 4	we just had, and it would come up with new kinds of tests.
6	19:55-23:46	Main discussion 5	I mean, it can either fail to reproduce itself or it can guess what the solution might be and get it wrong or something.
7	23:49-28:12	Main discussion 6	I mean Codex security reviews, pull requests, which is fine.
8	28:14-31:49	Closing points	I'm I want to next I'm looking at basically I mean I.

Named Concepts / Search Anchors

Tests And Observability - Topic named in the talk metadata and used as a search anchor for transcript Q&A.
AI-Generated Software Validation - Topic named in the talk metadata and used as a search anchor for transcript Q&A.
Runtime Signals - Topic named in the talk metadata and used as a search anchor for transcript Q&A.
Test Reliability - Topic named in the talk metadata and used as a search anchor for transcript Q&A.
Keeping AI Honest - Topic named in the talk metadata and used as a search anchor for transcript Q&A.

Useful Search Terms

tests and observability
AI-generated software validation
runtime signals
test reliability
keeping AI honest

Open Questions / Limits

The outline is generated from timestamped transcript text rather than speaker-provided slides.
Some transcript terms may be speech-to-text artifacts.
For precise claims, inspect transcript.md around the relevant timestamp.

Duration Marker

Last observed timestamp: 31:49

.tessl-plugin

talk-azriel-executable-specs-agentic-coding

talk-batey-building-product-teams-age-of-ai

talk-birgitta-closing-keynote

talk-cormack-tests-lie-observability-ai-honest

talk-debois-agent-enablement

talk-douglas-training-ai-on-your-own-code

talk-dubnov-merge-rate-ai-adoption

talk-farley-vibe-coding-best-we-can-do

talk-firtman-web-mcp-agentic-web

talk-foxwell-reinvention-dev-team

talk-graziano-spec-driven-development

talk-groetzinger-skills-everywhere

talk-jones-odevo-ai-native-transformation

talk-jourdan-pipelines-to-prompts

talk-katsioloudes-code-security-ai

talk-kerr-bipolar-disorder-dysregulation-ai

talk-lamis-context-engineering-dreaming

talk-lawson-agent-experience

talk-lopopolo-harness-engineering-humans-steer-agents-execute

talk-luebken-embedding-pi-coding-agent

talk-maleix-collective-intelligence

talk-marsden-agent-desktops

talk-martinelli-spec-driven-development

talk-moss-skills-team-workflow

talk-obstbaum-willoughby-evals-hard

talk-overweg-one-brain-no-filtering

talk-podjarny-skills-are-the-new-code

talk-roberts-ai-native-brownfield

talk-roberts-brownfield-ai-native

talk-scheire-artificial-intelligence

talk-selajev-docker-sandboxes-agents

talk-sloan-harness-engineering-beyond-code

talk-smith-connecting-context-future-transports

talk-stack-humans-architect-ai-writes-code

talk-stoneham-product-brain

talk-syme-agentic-repository-automation

talk-tal-skills-security

talk-thomas-ai-native-engineering

talk-trieloff-browser-agents

talk-walter-runtime-intelligence-agents

talk-wilson-cq-stack-overflow-for-agents

talk-wotherspoon-humans-vs-slop

README.md

tile.json