Implement Antithesis workloads by turning the property catalog into SDK assertions and test commands, then refine coverage after triage.
64
55%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./antithesis-workload/SKILL.mdImplement or improve the Antithesis workload. Success means:
antithesis-research skill are mapped to concrete assertions, using both the property catalog and per-property evidence filesantithesis/test/ and exercise the right behaviorsUse the antithesis-research skill first to build the property catalog. Use the antithesis-setup skill to scaffold the infrastructure. Use the antithesis-triage skill to review runs, then return here to improve the workload. If the user asks to submit or launch a run, use the antithesis-launch skill — do not run snouty run directly.
antithesis/scratchbook/) doesn't exist. Use the antithesis-research skill to create it.docker-compose.yaml for Antithesis present. Use the antithesis-setup skill to create it.snouty is not installed. See https://raw.githubusercontent.com/antithesishq/snouty/refs/heads/main/README.md for installation options.Each invocation of the "Implement next property" workflow below focuses on one property to keep context manageable and quality high. (Post-triage iteration follows its own scoping based on triage findings.)
If the user asks for multiple properties, recommend doing one at a time — explain that implementation quality degrades as context accumulates, and each property's effort is unpredictable. Ask which one they'd like to start with. If they insist on multiple, proceed — but warn them first.
If the user specifies which property to work on, skip the full catalog scan — but still assess that property's status against its evidence file before proceeding. If it's already fully implemented, tell the user rather than redoing work.
If the user does not specify a property, run the full detection and recommendation flow below.
The detection work below is context-heavy (reading every evidence file, scanning the codebase for assertions). If your agent supports sub-agents, delegate it to a sub-agent that returns a per-property summary (status + brief rationale). This keeps the main implementation agent's context clean.
The detection task: for each property in the catalog, search the existing test and SUT code for Antithesis SDK assertion calls and cross-reference them with the property's evidence file at antithesis/scratchbook/properties/{slug}.md. Assess whether the existing assertions cover the code paths, failure scenarios, and instrumentation points the evidence file describes. Classify each property as:
Note the catalog's provenance frontmatter (commit and updated fields) and include it when presenting status — e.g., "The property catalog is up-to-date as of <commit short hash> (<date>)." This lets the user judge whether the catalog reflects the current codebase or needs re-research.
Show the user the status of each property, then recommend one to implement next. Prefer partially-implemented properties that need completion, then unimplemented properties that cluster with recently implemented ones (see antithesis/scratchbook/property-relationships.md), then other high-priority unimplemented properties. Wait for the user to confirm or choose differently before proceeding.
For the chosen property, read both the catalog entry and its evidence file.
Ask the user only for blockers or scoping decisions you cannot infer safely, such as:
antithesis/scratchbook/property-catalog.md/opt/antithesis/test/v1/{name}/. Each timeline runs commands from one test template. Files or subdirectories prefixed with helper_ are ignored by Antithesis, so use that prefix for helper scripts kept alongside commands.parallel_driver_, singleton_driver_, serial_driver_, first_, eventually_, finally_, anytime_.Always / AlwaysOrUnreachable: Assertions for safety and correctness properties.Sometimes(cond): Assertions for liveness or non-trivial semantic states that should occur at least once.Reachable / Unreachable: Assertions about whether meaningful outcomes or forbidden paths are exercised.Use the antithesis-documentation skill to access these pages. Prefer snouty docs.
https://antithesis.com/docs/test_templates/test_composer_reference.mdhttps://antithesis.com/docs/using_antithesis/sdk.mdhttps://antithesis.com/docs/properties_assertions/assertions.mdhttps://antithesis.com/docs/environment/fault_injection.md| Reference | When to read |
|---|---|
references/component-implementation.md | Implementing workload-side components or wrappers |
references/assertions.md | Turning properties into SDK assertions |
references/test-commands.md | Writing commands and organizing test templates |
references/iteration.md | Improving coverage and assertions after triage |
references/component-implementation.mdreferences/assertions.mdreferences/test-commands.mdreferences/iteration.mdreferences/assertions.md if assertions need to changereferences/test-commands.md if command coverage needs to changeantithesis-setup has already made the system runnable in a mostly idle state; this skill owns what the workload does once the system is up.antithesis-setup has already installed the relevant SDK and added one minimal bootstrap assertion in the SUT. This skill owns the broader property catalog beyond that initial integration check.antithesis/test/antithesis/scratchbook/property-catalog.md when the implemented properties changeBefore declaring this skill complete, review your work against the criteria below. If your agent supports spawning sub-agents, create a new agent with fresh context to perform this review — give it the path to this skill file and have it read all output artifacts. A fresh-context reviewer catches blind spots that in-context review misses. If your agent does not support sub-agents, perform the review yourself: re-read the success criteria at the top of this file, then systematically check each item below against your actual output.
Review criteria:
Always/AlwaysOrUnreachable for safety, Sometimes(cond) for liveness or meaningful semantic state, Reachable/Unreachable for path and outcome checks)Sometimes(true, ...) assertions should be rewritten as Reachable(...).Reachable(...) markers are attached to distinct outcomes or branch results, not redundant early path-entry locations on the same straight-line flowantithesis/test/ and use valid prefixes (parallel_driver_, singleton_driver_, serial_driver_, first_, eventually_, finally_, anytime_)setup_complete is emitted before test commands begin/opt/antithesis/test/v1/{name}/ in the containerhelper_ so Antithesis ignores themantithesis/scratchbook/property-catalog.md is updated to reflect the implementation status of every property in scope, with provenance frontmatter (commit and updated) reflecting the current codebase statesnouty validate on antithesis/config to ensure that the compose setup can reach setup complete and any configured test-templates work. Make sure to build the latest images before running validate.a851a75
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.