Skill | Added | Review |
|---|---|---|
write-a-skill Create new agent skills with proper structure, progressive disclosure, and bundled resources. Use when user wants to create, write, or build a new skill. | 97 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
setup-pre-commit Set up Husky pre-commit hooks with lint-staged (Prettier), type checking, and tests in the current repo. Use when user wants to add pre-commit hooks, set up Husky, configure lint-staged, or add commit-time formatting/typechecking/testing. | 100 1.33x Agent success vs baseline Impact 100% 1.33xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 733d312 | |
scaffold-exercises Create exercise directory structures with sections, problems, solutions, and explainers that pass linting. Use when user wants to scaffold exercises, create exercise stubs, or set up a new course section. | 97 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
grill-me Interview the user relentlessly about a plan or design until reaching shared understanding, resolving each branch of the decision tree. Use when user wants to stress-test a plan, get grilled on their design, or mentions "grill me". | 85 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
edit-article Edit and improve articles by restructuring sections, improving clarity, and tightening prose. Use when user wants to edit, revise, or improve an article draft. | 65 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
migrate-to-shoehorn Migrate test files from `as` type assertions to @total-typescript/shoehorn. Use when user mentions shoehorn, wants to replace `as` in tests, or needs partial test data. | 87 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
tdd Test-driven development with red-green-refactor loop. Use when user wants to build features or fix bugs using TDD, mentions "red-green-refactor", wants integration tests, or asks for test-first development. | 78 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
git-guardrails-claude-code Set up Claude Code hooks to block dangerous git commands (push, reset --hard, clean, branch -D, etc.) before they execute. Use when user wants to prevent destructive git operations, add git safety hooks, or block git push/reset in Claude Code. | 34 1.19x Agent success vs baseline Impact 100% 1.19xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 733d312 | |
diagnose Disciplined diagnosis loop for hard bugs and performance regressions. Reproduce → minimise → hypothesise → instrument → fix → regression-test. Use when user says "diagnose this" / "debug this", reports a bug, says something is broken/throwing/failing, or describes a performance regression. | 97 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
improve-codebase-architecture Find deepening opportunities in a codebase, informed by the domain language in CONTEXT.md and the decisions in docs/adr/. Use when the user wants to improve architecture, find refactoring opportunities, consolidate tightly-coupled modules, or make a codebase more testable and AI-navigable. | 80 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
obsidian-vault Search, create, and manage notes in the Obsidian vault with wikilinks and index notes. Use when user wants to find, create, or organize notes in Obsidian. | 95 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
request-refactor-plan Create a detailed refactor plan with tiny commits via user interview, then file it as a GitHub issue. Use when user wants to plan a refactor, create a refactoring RFC, or break a refactor into safe incremental steps. | 82 3.20x Agent success vs baseline Impact 80% 3.20xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 733d312 | |
to-prd Turn the current conversation context into a PRD and publish it to the project issue tracker. Use when user wants to create a PRD from the current context. | 67 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
grill-with-docs Grilling session that challenges your plan against the existing domain model, sharpens terminology, and updates documentation (CONTEXT.md, ADRs) inline as decisions crystallise. Use when user wants to stress-test a plan against their project's language and documented decisions. | 85 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
zoom-out Tell the agent to zoom out and give broader context or a higher-level perspective. Use when you're unfamiliar with a section of code or need to understand how it fits into the bigger picture. | 48 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
caveman Ultra-compressed communication mode. Cuts token usage ~75% by dropping filler, articles, and pleasantries while keeping full technical accuracy. Use when user says "caveman mode", "talk like caveman", "use caveman", "less tokens", "be brief", or invokes /caveman. | 100 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
to-issues Break a plan, spec, or PRD into independently-grabbable issues on the project issue tracker using tracer-bullet vertical slices. Use when user wants to convert a plan into issues, create implementation tickets, or break down work into issues. | 97 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 733d312 | |
ubiquitous-language Extract a DDD-style ubiquitous language glossary from the current conversation, flagging ambiguities and proposing canonical terms. Saves to UBIQUITOUS_LANGUAGE.md. Use when user wants to define domain terms, build a glossary, harden terminology, create a ubiquitous language, or mentions "domain model" or "DDD". | 93 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
design-an-interface Generate multiple radically different interface designs for a module using parallel sub-agents. Use when user wants to design an API, explore interface options, compare module shapes, or mentions "design it twice". | 88 1.36x Agent success vs baseline Impact 98% 1.36xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: 733d312 | |
qa Interactive QA session where user reports bugs or issues conversationally, and the agent files GitHub issues. Explores the codebase in the background for context and domain language. Use when user wants to report bugs, do QA, file issues conversationally, or mentions "QA session". | 90 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 733d312 | |
prototype Build a throwaway prototype to flush out a design before committing to it. Routes between two branches — a runnable terminal app for state/business-logic questions, or several radically different UI variations toggleable from one route. Use when the user wants to prototype, sanity-check a data model or state machine, mock up a UI, explore design options, or says "prototype this", "let me play with it", "try a few designs". | 90 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 7014111 | |
triage Triage issues through a state machine driven by triage roles. Use when user wants to create an issue, triage issues, review incoming bugs or feature requests, prepare issues for an AFK agent, or manage issue workflow. | 90 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 7014111 | |
setup-matt-pocock-skills Sets up an `## Agent skills` block in AGENTS.md/CLAUDE.md and `docs/agents/` so the engineering skills know this repo's issue tracker (GitHub or local markdown), triage label vocabulary, and domain doc layout. Run before first use of `to-issues`, `to-prd`, `triage`, `diagnose`, `tdd`, `improve-codebase-architecture`, or `zoom-out` — or if those skills appear to be missing context about the issue tracker, triage labels, or domain docs. | 93 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 7014111 | |
writing-fragments Grilling session that mines the user for fragments — heterogeneous nuggets of writing (claims, vignettes, sharp sentences, half-thoughts) — and appends them to a single document as raw material for a future article. Use when the user wants to develop ideas before imposing structure, or mentions "fragments", "ideate", or "raw material" for writing. | 90 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 494e4b2 | |
handoff Compact the current conversation into a handoff document for another agent to pick up. | 61 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 494e4b2 |