AI Native DevCon 2026 London — all conference sessions as interactive skills
71
89%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Risky
Do not use without reviewing
Justin Cormack
[inferred from filename and transcript] When Tests Lie: Using Observability to Keep AI Honest is an AI Native DevCon session covering tests and observability, AI-generated software validation, runtime signals, test reliability, and keeping AI honest.
[inferred] The talk's main contribution is its framing of tests and observability, AI-generated software validation, runtime signals for practitioners working with AI-native software development.
The source is timestamped speech-to-text output. Speaker labels, punctuation, and some technical terms may be imperfect. Use timestamps when citing.
| # | Timestamp range | Section | Summary |
|---|---|---|---|
| 1 | 00:00-03:41 | Opening and framing | This this feedback loop that we really need to understand if AI is is truly helping us in our production code or not. |
| 2 | 03:42-07:31 | Main discussion 1 | It's like, if we can do this in less time, we can build more of these interesting systems. |
| 3 | 07:35-11:49 | Main discussion 2 | And you can build a test suite against a simple version first. |
| 4 | 11:54-15:57 | Main discussion 3 | I kept finding edge cases by reading the AWS documentation thinking. |
| 5 | 16:00-19:48 | Main discussion 4 | we just had, and it would come up with new kinds of tests. |
| 6 | 19:55-23:46 | Main discussion 5 | I mean, it can either fail to reproduce itself or it can guess what the solution might be and get it wrong or something. |
| 7 | 23:49-28:12 | Main discussion 6 | I mean Codex security reviews, pull requests, which is fine. |
| 8 | 28:14-31:49 | Closing points | I'm I want to next I'm looking at basically I mean I. |
transcript.md around the relevant timestamp.Last observed timestamp: 31:49
.tessl-plugin
talk-azriel-executable-specs-agentic-coding
talk-batey-building-product-teams-age-of-ai
talk-birgitta-closing-keynote
talk-cormack-tests-lie-observability-ai-honest
talk-debois-agent-enablement
talk-douglas-training-ai-on-your-own-code
talk-dubnov-merge-rate-ai-adoption
talk-farley-vibe-coding-best-we-can-do
talk-firtman-web-mcp-agentic-web
talk-foxwell-reinvention-dev-team
talk-graziano-spec-driven-development
talk-groetzinger-skills-everywhere
talk-jones-odevo-ai-native-transformation
talk-jourdan-pipelines-to-prompts
talk-katsioloudes-code-security-ai
talk-kerr-bipolar-disorder-dysregulation-ai
talk-lamis-context-engineering-dreaming
talk-lawson-agent-experience
talk-lopopolo-harness-engineering-humans-steer-agents-execute
talk-luebken-embedding-pi-coding-agent
talk-maleix-collective-intelligence
talk-marsden-agent-desktops
talk-martinelli-spec-driven-development
talk-moss-skills-team-workflow
talk-obstbaum-willoughby-evals-hard
talk-overweg-one-brain-no-filtering
talk-podjarny-skills-are-the-new-code
talk-roberts-ai-native-brownfield
talk-roberts-brownfield-ai-native
talk-scheire-artificial-intelligence
talk-selajev-docker-sandboxes-agents
talk-sloan-harness-engineering-beyond-code
talk-smith-connecting-context-future-transports
talk-stack-humans-architect-ai-writes-code
talk-stoneham-product-brain
talk-syme-agentic-repository-automation
talk-tal-skills-security
talk-thomas-ai-native-engineering
talk-trieloff-browser-agents
talk-walter-runtime-intelligence-agents
talk-wilson-cq-stack-overflow-for-agents
talk-wotherspoon-humans-vs-slop