Plan and run usability tests on existing or prototype designs including test design, task scripts, moderation, observation, and findings synthesis. Use this skill whenever the user wants to test usability, run a moderated test, run an unmoderated test, validate a design, find usability issues, or improve task completion. Triggers on usability test, usability testing, moderated test, unmoderated test, task script, think aloud, prototype testing, user testing, design validation, task completion. Also triggers when the user has built something and wants to know if real users can use it before shipping.
72
88%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Plan and run tests that find usability problems before users hit them in production. Stack-agnostic. Tool-agnostic.
This skill is for testing existing designs or prototypes. For broader discovery research, use ux-research. For conversion testing in production, use cro-optimization.
ux-research)cro-optimization)journey-mapping)analytics-strategy)Don't test the whole product. Test specific tasks.
Task selection criteria:
Examples of testable tasks:
"You want to find a contractor near you who can install a fence. Show me how you'd do that on this site."
"You're a first-time visitor. You want to understand if this product fits your needs. Walk me through how you'd evaluate it."
"Your team needs a new tool to manage projects. Use this site to figure out which plan is right for a 12-person team."
Task framing rules:
Moderated (live, with researcher):
Unmoderated (recorded, asynchronous):
For most teams: moderated for early/critical decisions, unmoderated for ongoing validation.
Target audience - not just convenience.
Recruit criteria:
Sample size:
Pre-task setup:
Moderated session structure:
Moderation principles:
Anti-patterns:
Patterns across participants are signal. Single-participant complaints are weaker (but worth investigating).
Synthesis steps:
Report structure:
# Usability Test: [Design / flow]
## Summary
[2 to 3 paragraphs covering: what was tested, headline findings, top 3 priorities]
## Method
[Moderated/unmoderated, sample size, audience, dates, tasks]
## Critical findings
[Each with description, frequency, supporting evidence (quotes/clips), recommendation]
## Major findings
[Same structure]
## Minor findings
[Brief]
## Cosmetic findings
[Briefest]
## What worked well
[Calibration: capture successes too]
## Recommendations
[Prioritized list with effort estimates]
## Next steps
[Test re-run schedule, design iteration plan]Default outputs:
usability-test-plan-[topic].mdusability-tasks-[topic].mdusability-findings-[topic].mdreferences/task-script-patterns.md - Task framing patterns by common product type, with good and bad examples.8e70d03
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.