CtrlK
BlogDocsLog inGet started
Tessl Logo

honeybadge/virtui

automatically control and record tui application sessions from the terminal

93

4.45x
Quality

94%

Does it follow best practices?

Impact

89%

4.45x

Average score across 3 eval scenarios

SecuritybySnyk

Risky

Do not use without reviewing

Overview
Quality
Evals
Security
Files

SKILL.mdskills/virtui/

name:
virtui
description:
Use when an AI agent needs to programmatically drive, test, or record a terminal (TUI) application. Launches programs via pseudo-TTY, captures screen output, sends keystrokes and control sequences, and records sessions to asciicast format. Trigger on: launching interactive terminal programs, automating CLI workflows, testing TUI apps, recording terminal sessions, sending keystrokes to running programs, waiting for terminal output, or any task that requires controlling a terminal application non-interactively.

virtui — TUI Automation for AI Agents

Prerequisites

Before using virtui, ensure it is installed and the daemon is running.

Check / Install

# Check if virtui is available
virtui version

If not installed, install via Homebrew:

brew install honeybadge-labs/tap/virtui

Start the Daemon

The daemon must be running before any session commands. Always check first:

virtui --json daemon status
# → {"running":true,"socket":"..."} or {"running":false,"socket":"..."}

If not running:

virtui --json daemon start
# → {"pid":1234,"socket":"/Users/you/.virtui/daemon.sock"}

The daemon manages sessions over a Unix socket at ~/.virtui/daemon.sock.

Core Workflow

The typical workflow is: start daemon → run session → interact → screenshot → kill session.

1. Launch a Session

virtui --json run <command...>

This returns a session_id you'll use for all subsequent commands. Common options:

FlagDefaultPurpose
--cols80Terminal width
--rows24Terminal height
--dircwdWorking directory
-e KEY=VALEnvironment variables (repeatable)
--recordoffRecord session as asciicast v2
--record-pathautoCustom recording path

Example:

virtui --json run --cols 120 --rows 40 bash
# → {"session_id":"a1b2c3d4","pid":1234,"recording_path":""}

2. Execute Commands

exec is the primary interaction command. It types input, presses Enter, and optionally waits for a screen condition before returning.

Claude Code note: The Bash tool strips trailing newlines from command arguments, so exec may not actually send Enter when called from Claude Code. Prefer using type + press Enter as separate steps (or use a pipeline) to guarantee Enter is sent. See the Pipeline section for a concrete example.

Wait-condition caveat: Wait conditions check the screen immediately after input is sent. If the target text already appears on screen (e.g., in the typed command itself), the wait can resolve in 0 ms — before the command's actual output appears. For reliable results, use a pipeline with separate typepress Enterwait steps, or follow exec with a standalone wait command.

virtui --json exec <session_id> "<input>" [wait flags]

Wait strategies (use exactly one, or none to return immediately with current screen state):

FlagWhen to use
--wait "text"Wait for specific text to appear on screen
--wait-stableWait for screen to stop changing (500 ms of no updates — does not guarantee the process finished)
--wait-gone "text"Wait for text to disappear (e.g., a loading spinner)
--wait-regex "pattern"Wait for a regex match on screen
--timeout <ms>Override default 30s timeout (use with any wait flag)

Examples:

# Run a command and wait for its output
virtui --json exec $SID "echo hello" --wait "hello"

# Run a build and wait for screen to settle (500ms of no changes)
virtui --json exec $SID "make build" --wait-stable --timeout 60000

# Wait for a loading indicator to disappear
virtui --json exec $SID "npm install" --wait-gone "resolving" --timeout 120000

3. Take Screenshots

Read the current screen contents at any time:

virtui --json screenshot <session_id>
virtui screenshot <session_id> --no-color

Returns screen_text, screen_hash, screen_ansi, cursor position, and terminal dimensions. Use screen_hash for cheap change detection without comparing full screen text.

  • Use screen_text for plain text assertions and screen-diff logic.
  • Use screen_ansi when color, background color, bold, underline, or reverse video matter.
  • Use virtui screenshot --no-color <session_id> when you want plain text output in non-JSON mode without ANSI escape sequences.

4. Send Keystrokes

For interactive TUI applications (menus, editors, prompts), use press and type:

# Send special keys
virtui press <session_id> Enter
virtui press <session_id> Ctrl+C
virtui press <session_id> ArrowDown --repeat 5
virtui press <session_id> Escape q      # multiple keys in sequence

# Type text without pressing Enter (for search fields, partial input)
virtui type <session_id> "search query"

See references/keys-and-errors.md for the full list of available key names and error codes.

5. Wait for Conditions

Wait independently of exec (useful after press/type or for polling):

virtui --json wait <session_id> --text "Ready"
virtui --json wait <session_id> --stable
virtui --json wait <session_id> --gone "Loading..."
virtui --json wait <session_id> --regex "v\d+\.\d+"

6. Clean Up

Always kill sessions when done:

virtui kill <session_id>

List active sessions:

virtui --json sessions show
# → {"sessions":[{"session_id":"a1b2c3d4","pid":1234,"command":["bash"],"cols":80,"rows":24,"running":true,"exit_code":-1,"created_at":"1711900000","recording_path":""}]}

7. Resize a Session

Resize the terminal dimensions of a running session. Both --cols and --rows are required (unlike run, there are no defaults):

virtui resize <session_id> --cols 120 --rows 40

Pipeline (Batch Operations)

For complex multi-step interactions, use pipeline to send a batch of steps in one call.

Recommended for Claude Code: Because exec relies on Enter being sent and Claude Code's Bash tool can swallow trailing newlines, the most reliable pattern from Claude Code is to use a pipeline with explicit type + press Enter steps instead of exec.

Example: Running a command reliably from Claude Code

echo '{"steps":[
  {"type":{"text":"echo hello world"}},
  {"press":{"keys":["Enter"]}},
  {"wait":{"condition":{"text":"hello world"},"timeout_ms":5000}},
  {"screenshot":{}}
],"stop_on_error":true}' | virtui --json pipeline $SID

This is equivalent to virtui exec $SID "echo hello world" --wait "hello world" but guarantees Enter is actually sent regardless of how the shell tool handles newlines.

Pipeline from file

Instead of piping JSON via stdin, you can pass a file containing the steps:

virtui --json pipeline <session_id> --file steps.json

General pipeline example

echo '{"steps":[
  {"exec":{"input":"ls","wait":{"text":"README"}}},
  {"sleep":{"duration_ms":500}},
  {"screenshot":{}},
  {"press":{"keys":["Ctrl+C"]}}
],"stop_on_error":true}' | virtui --json pipeline <session_id>

Step types: exec, press, type, wait, screenshot, sleep.

Pipeline output format

The pipeline returns a JSON object with a results array. Each entry contains the step outcome and a screenshot of the screen state at that point:

{
  "results": [
    {
      "step_index": 0,
      "success": true,
      "error_message": "",
      "screenshot": {
        "screen_text": "...",
        "screen_hash": "...",
        "screen_ansi": "...",
        "cursor_row": 4,
        "cursor_col": 10,
        "cols": 80,
        "rows": 24
      }
    }
  ]
}

Recording Sessions

Record sessions as asciicast v2 (playable with asciinema play):

virtui --json run --record bash
# or with a custom path:
virtui --json run --record --record-path ./demo.cast bash

Recording stops when the session is killed or the process exits.

Note: --record-path requires --record to be set. Using --record-path alone will not enable recording.

Important Patterns

  • Always pass --json (or -j) for machine-readable output with session_id, screen_text, screen_hash, screen_ansi, etc.
  • JSON output uses proto3 JSON encoding: int64 fields (elapsed_ms, created_at) are serialized as strings.
  • Use screen_hash (SHA-256) for cheap change detection without comparing full screen text.
  • Prefer screen_text for text matching and screen_ansi only when TUI state depends on styling or color.
  • The default wait timeout is 30s. For long-running ops, increase it: --timeout 120000.
  • If a wait times out, take a screenshot to see current state and decide how to proceed.
  • Errors include code, category, message, retryable, suggestion — check retryable before retrying. See references/keys-and-errors.md for error codes and key names.

JSON Output Fields Reference

CommandKey Fields
runsession_id, pid, recording_path
execscreen_text, screen_hash, cursor_row, cursor_col, elapsed_ms
screenshotscreen_text, screen_hash, screen_ansi, cursor_row, cursor_col, cols, rows
pressscreen_text, screen_hash
typescreen_text, screen_hash
waitscreen_text, screen_hash, elapsed_ms
killok
resizeok
pipelineresults[]: step_index, success, error_message, screenshot (see Pipeline output format)
sessionssessions[]: session_id, pid, command, cols, rows, running, exit_code, created_at, recording_path
daemon startpid, socket (background) or socket (foreground)
daemon stopok, optionally message
daemon statusrunning, socket

skills

tile.json