AI Native DevCon 2026 London — all conference sessions as interactive skills
71
89%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Risky
Do not use without reviewing
Groetzinger argues that frontier models are already capable enough for business value — the real bottleneck is context engineering, and skills are the durable, harness-portable, model-portable investment. He walks through two Cisco patterns: (1) a support-side pipeline that converts curated knowledge-base articles into agent skills with change-severity-gated human review, and (2) a developer-side pattern of shipping an evaluation framework to eight globally distributed teams as an installable skill instead of an onboarding meeting. The unifying discipline is evals as unit tests for agents and a cultural reflex of asking "Is this a skill?" before writing any piece of institutional knowledge.
Each line in the eval dataset is a self-contained JSON object:
{"input": "<user turn or task prompt>", "expected": "<ideal agent response or action>", "tags": ["<scenario-tag>"]}Example entries:
{"input": "How do I reset a Cisco device to factory defaults?", "expected": "Step 1: ...", "tags": ["reset", "hardware"]}
{"input": "Summarise the BGP configuration policy.", "expected": "The BGP policy requires ...", "tags": ["networking", "policy"]}The dataset file is co-versioned with the skill (see Semantic Versioning below). A failing eval on a new model version is a signal to update the skill before deploying to production.
Groetzinger's Cisco support team converts curated knowledge-base articles into agent skills using an LLM-gated diff pipeline:
A minimal skill following Groetzinger's pattern:
---
name: <kebab-case-name>
description: "Use when ... <trigger scenarios and natural keywords>"
version: 0.1.0
---
# <Skill Title>
<One-paragraph summary of what the skill does and why it exists.>
## When to Use
- <Trigger condition 1>
- <Trigger condition 2>
## Steps
1. <Step 1>
2. <Step 2>
## Evals
See `evals.jsonl` co-located with this file.| Version | Meaning |
|---|---|
| 0.0.x | Patch: minor wording fixes, no behaviour change |
| 0.x.0 | Minor: new content added, no breaking change to evals |
| 1.0.0 | Stable: passing eval suite, at least one production deployment, human-reviewed README synced to Confluence |
A skill graduates to 1.0 when it has a consistently passing eval suite, a production deployment record, and a human-reviewed README synced to the internal wiki.
Groetzinger's approach for scaling to eight globally distributed teams:
| Artifact | Starting Point |
|---|---|
| Eval dataset | Copy the JSONL schema above; write 5–10 entries covering key scenarios for the skill |
| Skill file | Use the Skill File Structure template above; start at version 0.1.0 |
| KB-to-skill pipeline | Follow the 6-step pipeline above; adjust gate thresholds to your team's review capacity |
| Confluence sync | Treat the skill README as the single source of truth; automate the push on merge |
.tessl-plugin
talk-azriel-executable-specs-agentic-coding
talk-batey-building-product-teams-age-of-ai
talk-birgitta-closing-keynote
talk-cormack-tests-lie-observability-ai-honest
talk-debois-agent-enablement
talk-douglas-training-ai-on-your-own-code
talk-dubnov-merge-rate-ai-adoption
talk-farley-vibe-coding-best-we-can-do
talk-firtman-web-mcp-agentic-web
talk-foxwell-reinvention-dev-team
talk-graziano-spec-driven-development
talk-groetzinger-skills-everywhere
talk-jones-odevo-ai-native-transformation
talk-jourdan-pipelines-to-prompts
talk-katsioloudes-code-security-ai
talk-kerr-bipolar-disorder-dysregulation-ai
talk-lamis-context-engineering-dreaming
talk-lawson-agent-experience
talk-lopopolo-harness-engineering-humans-steer-agents-execute
talk-luebken-embedding-pi-coding-agent
talk-maleix-collective-intelligence
talk-marsden-agent-desktops
talk-martinelli-spec-driven-development
talk-moss-skills-team-workflow
talk-obstbaum-willoughby-evals-hard
talk-overweg-one-brain-no-filtering
talk-podjarny-skills-are-the-new-code
talk-roberts-ai-native-brownfield
talk-roberts-brownfield-ai-native
talk-scheire-artificial-intelligence
talk-selajev-docker-sandboxes-agents
talk-sloan-harness-engineering-beyond-code
talk-smith-connecting-context-future-transports
talk-stack-humans-architect-ai-writes-code
talk-stoneham-product-brain
talk-syme-agentic-repository-automation
talk-tal-skills-security
talk-thomas-ai-native-engineering
talk-trieloff-browser-agents
talk-walter-runtime-intelligence-agents
talk-wilson-cq-stack-overflow-for-agents
talk-wotherspoon-humans-vs-slop