AI Native DevCon 2026 London — all conference sessions as interactive skills
70
88%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Tammuz Dubnov — Founder & CTO of Autonomy AI, which builds autonomous AI agents enabling non-technical and technical users to ship code safely into enterprise codebases through agent-driven workflows. Over a decade leading AI startups across text, vision, audio, and vector domains; holds several patents and authored multiple publications. Served as an elite academic officer in Unit 8200, leading AI-driven projects. UC Berkeley graduate at 18 with honours in Theoretical Mathematics; master's in AI from UC San Diego. International speaker, university lecturer, and award-winning performer.
In early 2026, our PM started opening pull requests to our production codebase. Not prototypes — real, merged code. This talk is about what that demanded from the engineering team, and from the PM herself.
AI didn't eliminate our bottleneck — it moved it. Code generation stopped being the constraint. Review, coordination, and architectural alignment became the new pressure points. Merge rate — the percentage of PRs that actually land in production — became the signal that told us whether the team was adapting or drowning.
Getting there required change on both sides. Engineers had to rethink how they review, what they gate on, and how much bandwidth they allocate. The PM had to learn which changes she could ship independently and which required coordination first. Neither side got it right immediately.
Once we found that alignment, the effect compounded. Features that used to take a full sprint now land in days. Entire product areas are owned end-to-end by the PM. The developers freed from feature delivery work at a higher level — focused on architecture, patterns, and system design. The team didn't just absorb the change. It got faster because of it.
"AI-native" doesn't mean giving developers bigger token budgets — it means collapsing the handover so the person who cares and has authority can also execute, with agents doing the work. The bottleneck has moved from code generation to review/coordination/architecture. Merge rate (and especially the share of non-tech-authored PRs that merge with zero dev touches) is the metric that tells you if you're actually adapting. Doing this requires (a) tools tuned for the role — not cloud code for everyone, (b) a harness that evolves as agents make mistakes, and (c) keeping engineering guards (tests, feature flags, architecture review) in place even as velocity rises.
| § | Heading | 1-line summary | Lines |
|---|---|---|---|
| 1 | Intro & framing questions | Host intro; Tammuz polls the room on CFO AI-spend conversations and on whether PMs/designers are opening PRs | ~1–25 |
| 2 | What "AI-native" actually means | Tammuz's definition: the person who cares + has authority is also the person who can do the work; AI collapses the handover | ~26–55 |
| 3 | Why handover is the real bottleneck | Sprint-cycle handovers from PM→design→dev→review compress to ~13 minutes; everyone gets to focus on what they care about | ~56–95 |
| 4 | Wrong ways to go AI-native — Uber & Microsoft cautionary tales | Uber 6× AI spend exhausted in 4 months with no measurable feature-velocity link; Microsoft pulling back Claude Code rollout | ~96–125 |
| 5 | Where the tokens actually go | Of $100 spent on AI, only ~$18 ships meaningful code; rest goes to rework/bugs | ~126–138 |
| 6 | Right way — Shopify as positive example | Empowering non-engineers with thousands of Cursor licenses; ~50% of non-eng output accepted as-is | ~139–160 |
| 7 | Wrong-vs-right patterns enumerated | (a) more tokens to same devs ≠ velocity; (b) PM prototyping tools = "hurry up and wait"; (c) cloud code to everyone = PR fatigue + desk visits | ~161–195 |
| 8 | Harness engineering | Definition of harness; principles (onboard self, product-level language, long sessions, self-check, learn across users); feasibility on complex monorepos | ~196–250 |
| 9 | Authority boundaries — the failed PR example | Designer's image-versioning PR was tech-correct but storage-architecture-wrong; got rebuilt by dev, design retained | ~251–285 |
| 10 | How to measure AI-native adoption | (a) PR count per non-tech contributor; (b) merge rate (~74% benchmark); (c) zero-dev-touch rate of merged PRs (~84% benchmark) | ~286–315 |
| 11 | Closing — democratising authorship | Need an "OS unco" [likely "OS uncomplicated" / agent-OS] absolutely coupled to your codebase | ~316–330 |
| 12 | Q&A — measurement tooling | Tammuz: Autonomy's own system tracks author (shows as "Tammuz I bought" bot) + commit stream for post-merge touches | ~331–345 |
| 13 | Q&A — do engineers move into UI/PM space? | Yes — devs make product decisions to merge fast, everything feature-flagged, PMs/QA follow up with cleanup PRs | ~346–375 |
| 14 | Q&A — rollback / safety mechanisms | Depends on org's existing CI; harness adopts your practices (feature flags if you have them, tests if you write them); not opinionated | ~376–395 |
| 15 | Q&A — proving PR-fatigue cost to leadership | Agent labels every PR with risk level + size so reviewers can prioritise; team-wide load visibility | ~396–425 |
| 16 | Wrap-up & off-mic fragments | Host wraps; trailing post-talk fragments (someone from "Grana lamps" / Granola? on a green-field project) | ~426–end |
⚠️ The transcript contains heavy speech-to-text noise. "Animated" almost always means "AI-native". "Heard" / "harness" appears garbled in places. "Autonomy" is sometimes "Autonomy AI" / "Tanya" / "antinomy". Quote what's actually in the transcript and flag the likely intended word.
.tessl-plugin
talk-azriel-executable-specs
talk-baker-sadogursky-context-engineering-skills
talk-batey-building-product-teams-age-of-ai
talk-birgitta-closing-keynote
talk-cormack-tests-lie-observability-ai
talk-debois-agent-enablement
talk-douglas-training-ai-on-your-own-code
talk-dubnov-merge-rate-ai-adoption
talk-farley-vibe-coding-best-we-can-do
talk-firtman-web-mcp-agentic-web
talk-foxwell-reinvention-dev-team
talk-groetzinger-skills-everywhere
talk-jones-odevo-ai-native-transformation
talk-jourdan-pipelines-to-prompts
talk-katsioloudes-code-security-ai
talk-kerr-bipolar-disorder-dysregulation-ai
talk-kushwaha-benchmarking-agent-era
talk-lamis-context-engineering-dreaming
talk-lawson-agent-experience
talk-lopopolo-harness-engineering
talk-lubken-embedding-pi-coding-agent
talk-maleix-collective-intelligence
talk-marsden-agent-desktops
talk-martinelli-spec-driven-development
talk-moss-skills-team-workflow
talk-obstbaum-willoughby-vibes-to-metrics
talk-overweg-one-brain-no-filtering
talk-podjarny-skills-are-the-new-code
talk-roberts-ai-native-brownfield
talk-roberts-brownfield-ai-native
talk-ruiz-agents-on-canvas-tldraw
talk-scheire-artificial-intelligence
talk-selajev-docker-sandboxes-agents
talk-sloan-harness-engineering-beyond-code
talk-smith-connecting-context-future-transports
talk-stack-humans-architect-ai-writes-code
talk-syme-agentic-repository-automation
talk-thomas-ai-native-engineering
talk-trieloff-browser-agents
talk-walter-runtime-intelligence-agents
talk-wotherspoon-humans-vs-slop