github.com/coder/agent-tty
Skill | Added | Review |
|---|---|---|
agent-tty Terminal and TUI automation CLI for AI agents. Use when the user needs to create a terminal session, run a command in a terminal, automate an interactive CLI or TUI, wait for terminal output, capture a TUI screenshot, export a terminal recording, or test a CLI workflow with reviewable artifacts. | 79 79 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: fae02cb | |
to-issues Break a plan, spec, or PRD into independently-grabbable issues on the project issue tracker using tracer-bullet vertical slices. | 66 66 0.95x Agent success vs baseline Impact 80% 0.95xAverage score across 1 eval scenario Securityby Advisory Suggest reviewing before use Reviewed: Version: fae02cb | |
agent-terminal Terminal and TUI automation CLI for AI agents. Use when the user needs to create a terminal session, run a command in a terminal, automate an interactive CLI or TUI, wait for terminal output, capture a TUI screenshot, export a terminal recording, or test a CLI workflow with reviewable artifacts. | 74 74 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: fae02cb | |
eval-guide Guide for running statistically meaningful agent-tty evals with trials, parallelism, and A/B comparison. Covers non-determinism baseline, recommended sample sizes, and result interpretation. | 62 62 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: fae02cb | |
agent-tty Terminal and TUI automation CLI for AI agents. Use when the user needs to create a terminal session, run a command in a terminal, automate an interactive CLI or TUI, wait for terminal output, capture a TUI screenshot, export a terminal recording, or test a CLI workflow with reviewable artifacts. | 79 79 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: fae02cb | |
triage Move issues and external PRs through a state machine of triage roles — categorise, verify, grill if needed, and write agent-ready briefs. | 76 76 1.42x Agent success vs baseline Impact 100% 1.42xAverage score across 2 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: fae02cb | |
release-maintainer Internal maintainer SOP for version bumps, release PRs, tagging, publishing, and post-publish verification in this repository. | 57 57 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: fae02cb | |
dogfood-tui Structured TUI dogfooding and QA workflow using agent-tty. Use for exploratory testing, bug hunting, release-readiness validation, and UX review of terminal applications. | 79 79 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: fae02cb | |
tdd Test-driven development. Use when the user wants to build features or fix bugs test-first, mentions "red-green-refactor", or wants integration tests. | 68 68 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: fae02cb | |
grill-with-docs A relentless interview to sharpen a plan or design, which also creates docs (ADR's and glossary) as we go. | 59 59 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: fae02cb | |
diagnose Disciplined diagnosis loop for hard bugs and performance regressions. Reproduce → minimise → hypothesise → instrument → fix → regression-test. Use when user says "diagnose this" / "debug this", reports a bug, says something is broken/throwing/failing, or describes a performance regression. | 98 98 0.97x Agent success vs baseline Impact 90% 0.97xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: fae02cb | |
improve-codebase-architecture Scan a codebase for deepening opportunities, present them as a visual HTML report, then grill through whichever one you pick. | 57 57 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: fae02cb | |
to-prd Turn the current conversation into a PRD and publish it to the project issue tracker — no interview, just synthesis of what you've already discussed. | 75 75 0.00x Agent success vs baseline Impact 0% 0.00xAverage score across 1 eval scenario Securityby Passed No known issues Reviewed: Version: fae02cb |