Skills are the new Code by Guy Podjarny
89
90%
Does it follow best practices?
Impact
87%
1.38xAverage score across 4 eval scenarios
Passed
No known issues
Guy Podjarny — Founder and CEO of Tessl, which is "reimagining software development for the AI era" and powers AI Native Dev Con. Previously founded Snyk (created the Developer Security category, now a multi-billion-dollar company with 1000+ employees). Former CTO at Akamai (following acquisition of his first startup). Active angel investor, co-host of the AI Native Dev podcast.
Software development has always revolved around the instructions we give the machine - punch cards, then assembly, then code. As agentic development takes hold, agent skills are becoming our unit of software - but we're not treating them this way.
In this keynote, I'll make the case that agent skills deserve the same rigour we've spent decades applying to code. Crafted with intention. Tested against real behaviour. Versioned and maintained in step with the project around them. Treating skills as an afterthought isn't just technical debt, it's the difference between AI that ships and AI that drifts.
The new units of agentic software are tools, context, and harnesses — composing upward into factory lines and factories. Within those, reusable skills are the asset developers edit most, and they now exhibit the same failure modes code does: security risk, collaboration friction, and rot. The remedy is to import the toolchain we built for code — static analysis, dynamic tests (evals), dependency management, security tooling, and observability — and wrap them in a Context Development Life Cycle (CDLC) that humans own while agents handle the SDLC.
| Section | Summary | Transcript lines |
|---|---|---|
| Pre-talk disclaimers | Guy notes this is a dry run, slides are still rough, asks for feedback | 1–18 |
| Opening & three-part agenda | New units of software; skills are the new code; dev tools for skills | 19–44 |
| Unit 1 — Tools | CLIs, MCPs, APIs; deterministic, save tokens, compose | 45–68 |
| Unit 2 — Context | Practices/policies, specs, workflows; rules, skills, passive context; skills compose | 69–98 |
| Unit 3 — Harnesses | Deterministic software wrapping probabilistic models; Claude Code as example; plugins and hooks; harnesses compose into factory lines | 99–138 |
| The agentic software stack | Tools → context → harnesses → factory lines → factories | 139–158 |
| Why skills are the dominant reusable context | Skills are "code-like" reusable context units; usage exploding | 159–172 |
| Problem 1 — Security & governance | Malicious skills (30%+ in OpenClaw), negligent skills, vulnerable skills (API keys) | 173–204 |
| Problem 2 — Collaboration & reuse | Unicorn platform team story; quality testing gap; dependency management gap | 205–232 |
| Problem 3 — Lifecycle & rot | Skills rot like software; opportunity for automated optimization from agent logs | 233–262 |
| Solution: treat skills as code | Five tools: static analysis, dynamic tests, dependency mgmt, security, observability | 263–278 |
| Tool 1 — Static analysis | Inspect skills without executing; Tessl Review; quality scores | 279–298 |
| Tool 2 — Dynamic tests (evals) | Evals are the new tests; scenarios at varying scope; can't scale without them | 299–334 |
| Tool 3 — Dependency management | Skills compose ⇒ skills are dependencies; versioning, manifests, platform compatibility | 335–360 |
| Tool 4 — Security tooling | Static analysis, supply chain, red teaming; Snyk integration | 361–388 |
| Tool 5 — Observability | Mine agent logs and PRs for real scenarios, gaps, new skill opportunities | 389–410 |
| The CDLC | Generate → test → optimize → distribute → observe; humans own CDLC, agents own SDLC | 411–428 |
| Wrap-up & the Tessl agent | Vertical agent for skill development; local, pipeline, control center | 429–470 |
gh pr open unless a PR skill is loaded.evals
scenario-1
scenario-2
scenario-3
scenario-4