AI Native DevCon 2026 London — all conference sessions as interactive skills
66
83%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Risky
Do not use without reviewing
John Groetzinger — Principal Engineer, Cisco Customer Experience Engineering. ~14 years at Cisco, joined via the Sourcefire acquisition (intrusion prevention software, still a core technology in Cisco firewall today). Spent ~12 years in Cisco TAC (Technical Assistance Center) supporting financial institutions, banks, hospitals, and critical infrastructure under high-pressure conditions. Full-stack developer for 12–14 years; last ~year focused on production agentic systems. AWS Certified Solutions Architect; AI-native development practitioner since 2023. Recently shipped a customer-facing AI-native multi-agent platform that reached GA "a couple weeks ago" with "over a thousand users on boarded in just a few weeks".
Agent architectures and dev tools change fast — what's current today is legacy in six months. But as long as you're using general-purpose frontier models, you will always need your context. Your runbooks, your engineering standards, your institutional knowledge — that's the layer that compounds. That's where the investment should go.
Here's the problem: most teams manage that context separately for humans and agents, if they manage it at all. The wiki says one thing, the agent's context says another, and someone files a bug about "hallucinations" when the real issue is stale documentation. Worse, if your engineers can't easily read and understand what the agent knows — without digging through a massive prompt — they'll never trust the system.
In Cisco Customer Experience, we've been building context pipelines that serve both audiences from a single source of truth. The same knowledge base article a support engineer reads is the skill an autonomous agent uses to pre-triage customer cases before a human even opens the message. A complex evaluation framework reached a global dev team in days through a skill install — not onboarding meetings or training sessions. When your humans trust the context and your agents use it correctly, the whole system flows.
This talk walks through the context pipelines we've built — packaging, versioning, and evaluating engineering knowledge as installable skills — and the cultural shift to make "is this a skill?" a reflex. Invest in your context — everything else is going to change anyway.
Frontier models are already good enough for business value at the medium tier — the binding constraint is context engineering, not model intelligence. Skills are the right packaging because they are honored across harnesses (Claude Code, Codex/dev, Copilot CLI) and across models. The two patterns that make skills work at enterprise scale are (a) pipelining human-curated knowledge into agent skills with LLM-driven updates gated by change severity and evals, and (b) treating evals as unit tests for agents so that a shared skill can be safely modified like a shared library. Humans should spend their time at the extremes — defining what "good" looks like via evals, and reviewing major changes — never on diffing markdown in the middle.
| # | Section | Summary | Transcript lines |
|---|---|---|---|
| 1 | MC introduction | MC sets up the talk and welcomes John on stage. | 1–8 |
| 2 | The real bottleneck is context, not models | You don't need a smarter model; you need smarter context. Medium-tier models suffice once skills + evals are in place. | 9–32 |
| 3 | Why skills changed his perspective | Skills are honored across harnesses and models, making context portable. Evals unlock using cheap models deterministically. | 33–54 |
| 4 | Personal background | 14 years at Cisco via Sourcefire, 12 years in TAC, last year on production agentic systems. GA platform with 1000+ users. | 55–72 |
| 5 | What a skill is | Minimum is a skill.md; can include rules (more markdown), scripts, scaffolding. Evals are the key addition. Worked example: a repo-standards skill that forces uv over pip. | 73–98 |
| 6 | Story 1 — Pipelining the Cisco TAC knowledge base into skills | Strong KB culture; MCP-search-the-HTML didn't work. Two engineers built a pipeline that converts curated articles into skills with evals. Change-severity gating routes only major changes to humans. | 99–158 |
| 7 | "Skills are not for you — stop writing them by hand" | The skill is for the agent. Humans should spend their precious time on evals and KB content, not on the markdown in the middle. Let the small model build its own skill. | 159–186 |
| 8 | Story 2 — Shipping an eval framework to 8 distributed teams as a skill | Built a custom evaluation framework (JSONL datasets, environment-aware scripts, observability-driven assertions) and packaged it as an installable skill instead of running onboarding meetings. | 187–262 |
| 9 | The README-to-Confluence sync | Single source of truth in a git repo with paired skill.md (for agents) and README.md (for humans). A deterministic markdown→HTML script syncs to Confluence for managers who don't use git. | 263–294 |
| 10 | The cultural shift: "is this a skill?" | Default reflex when anyone asks for documentation. Avoid 15 engineers building 15 skills for the same thing. Evals + shared skills work like a shared library. | 295–320 |
| 11 | What to do tomorrow | Start small with one concept you keep re-explaining. Add evals. Semantically version your skills (0.0.x → 0.x → 1.0). | 321–354 |
| 12 | Takeaways | Architectures change; context is the durable investment. Maintain in one place, distribute to many systems. | 355–366 |
| 13 | Q&A | Skill.md vs README similarity; orchestrating across many skills (skill explosion); repo layout; review/PR culture; pointer to deeper eval discussion offline. | 367–end |
.tessl-plugin
talk-batey-building-product-teams-age-of-ai
talk-birgitta-closing-keynote
talk-debois-agent-enablement
talk-douglas-training-ai-on-your-own-code
talk-dubnov-merge-rate-ai-adoption
talk-farley-vibe-coding-best-we-can-do
talk-firtman-web-mcp-agentic-web
talk-foxwell-reinvention-dev-team
talk-graziano-spec-driven-development
talk-groetzinger-skills-everywhere
talk-jones-odevo-ai-native-transformation
talk-jourdan-pipelines-to-prompts
talk-katsioloudes-code-security-ai
talk-lamis-context-engineering-dreaming
talk-lawson-agent-experience
talk-luebken-embedding-pi-coding-agent
talk-maleix-collective-intelligence
talk-maple-ai-native-devcon-welcome-slick
talk-maple-ai-native-devcon-welcome-spec-reviewer
talk-maple-aind-devcon-welcome
talk-maple-context-engineering-skills
talk-maple-continuous-ai-github-workflows
talk-maple-harness-engineering
talk-maple-tldraw-ai-canvas-experiments
talk-marsden-agent-desktops
talk-martinelli-spec-driven-development
talk-moss-skills-team-workflow
talk-overweg-one-brain-no-filtering
talk-podjarny-skills-are-the-new-code
talk-roberts-ai-native-brownfield
talk-roberts-brownfield-ai-native
talk-scheire-artificial-intelligence
talk-selajev-docker-sandboxes-agents
talk-sloan-harness-engineering-beyond-code
talk-stack-humans-architect-ai-writes-code
talk-stoneham-product-brain
talk-tal-skills-security
talk-thomas-ai-native-engineering
talk-walter-runtime-intelligence-agents
talk-wilson-cq-stack-overflow-for-agents
talk-wotherspoon-humans-vs-slop