github.com/coder/agent-tty
Skill | Added | Review |
|---|---|---|
agent-tty Terminal and TUI automation CLI for AI agents. Use when the user needs to create a terminal session, run a command in a terminal, automate an interactive CLI or TUI, wait for terminal output, capture a TUI screenshot, export a terminal recording, or test a CLI workflow with reviewable artifacts. | 76 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: a05d4e5 | |
to-issues Break a plan, spec, or PRD into independently-grabbable issues on the project issue tracker using tracer-bullet vertical slices. Use when user wants to convert a plan into issues, create implementation tickets, or break down work into issues. | 94 1.48x Agent success vs baseline Impact 83% 1.48xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
agent-terminal Terminal and TUI automation CLI for AI agents. Use when the user needs to create a terminal session, run a command in a terminal, automate an interactive CLI or TUI, wait for terminal output, capture a TUI screenshot, export a terminal recording, or test a CLI workflow with reviewable artifacts. | 72 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
eval-guide Guide for running statistically meaningful agent-tty evals with trials, parallelism, and A/B comparison. Covers non-determinism baseline, recommended sample sizes, and result interpretation. | 53 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
triage Triage issues through a state machine driven by triage roles. Use when user wants to create an issue, triage issues, review incoming bugs or feature requests, prepare issues for an AFK agent, or manage issue workflow. | 90 1.42x Agent success vs baseline Impact 94% 1.42xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: a05d4e5 | |
agent-tty Terminal and TUI automation CLI for AI agents. Use when the user needs to create a terminal session, run a command in a terminal, automate an interactive CLI or TUI, wait for terminal output, capture a TUI screenshot, export a terminal recording, or test a CLI workflow with reviewable artifacts. | 68 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
release-maintainer Internal maintainer SOP for version bumps, release PRs, tagging, publishing, and post-publish verification in this repository. | 48 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
dogfood-tui Structured TUI dogfooding and QA workflow using agent-tty. Use for exploratory testing, bug hunting, release-readiness validation, and UX review of terminal applications. | 64 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
tdd Test-driven development with red-green-refactor loop. Use when user wants to build features or fix bugs using TDD, mentions "red-green-refactor", wants integration tests, or asks for test-first development. | 62 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
grill-with-docs Grilling session that challenges your plan against the existing domain model, sharpens terminology, and updates documentation (CONTEXT.md, ADRs) inline as decisions crystallise. Use when user wants to stress-test a plan against their project's language and documented decisions. | 68 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
diagnose Disciplined diagnosis loop for hard bugs and performance regressions. Reproduce → minimise → hypothesise → instrument → fix → regression-test. Use when user says "diagnose this" / "debug this", reports a bug, says something is broken/throwing/failing, or describes a performance regression. | 90 0.97x Agent success vs baseline Impact 90% 0.97xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
improve-codebase-architecture Find deepening opportunities in a codebase, informed by the domain language in CONTEXT.md and the decisions in docs/adr/. Use when the user wants to improve architecture, find refactoring opportunities, consolidate tightly-coupled modules, or make a codebase more testable and AI-navigable. | 60 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: a05d4e5 | |
to-prd Turn the current conversation context into a PRD and publish it to the project issue tracker. Use when user wants to create a PRD from the current context. | 73 1.50x Agent success vs baseline Impact 98% 1.50xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: a05d4e5 |