antithesis-workload

Implement Antithesis workloads by turning the property catalog into SDK assertions and test commands, then refine coverage after triage.

Quality

—

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Quality

Content

85%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is highly actionable and well-structured with clear workflows, validation checkpoints, and well-signaled one-level-deep references, with only mild verbosity in the recommendation-strategy prose.

Suggestions

Tighten the "Present and recommend" and "Scoping" sections — the strategy rationale (e.g. "Simple properties build momentum...") can be condensed without losing the decision rule.

Reference references/multi-test-directories.md from the body or the Reference Files table, since it exists as a bundle file but is not currently discoverable from the skill.

Dimension	Reasoning	Score
Conciseness	Most content is novel domain guidance Claude would not know (paths, prefixes, assertion types), but the "Present and recommend" and "Scoping" strategy prose is wordy and could be tightened.	2 / 3
Actionability	Provides concrete executable guidance throughout: exact paths (antithesis/test/, /opt/antithesis/test/v1/{name}/), enumerated command prefixes, assertion types, and specific commands (snouty validate, snouty launch, snouty docs).	3 / 3
Workflow Clarity	Two clearly numbered workflows with "DO NOT PROCEED" prerequisite gates, a snouty validate checkpoint, triage-to-iteration feedback loop, and a Self-Review checklist.	3 / 3
Progressive Disclosure	A clear Reference Files table maps each reference to "When to read", references are one level deep, all referenced files exist, and the body acts as an overview pointing into them.	3 / 3
	Total	11 / 12 Passed

Description

82%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is specific, distinct, and uses natural domain trigger terms, but it omits an explicit "Use when..." clause, so it answers "what" well while leaving "when" only implied.

Suggestions

Add an explicit "Use when..." clause, e.g. "Use when implementing or refining Antithesis workloads, writing SDK assertions from the property catalog, or acting on triage findings."

Include a few common user phrasings (e.g. "Antithesis assertions", "property-based tests") to broaden natural trigger coverage.

Dimension	Reasoning	Score
Specificity	Lists multiple concrete actions — "turning the property catalog into SDK assertions and test commands, then refine coverage after triage" — rather than vague language.	3 / 3
Completeness	It clearly states what the skill does but lacks a "Use when..." clause or equivalent explicit trigger guidance, which caps completeness at 2 per the rubric guidelines.	2 / 3
Trigger Term Quality	Natural domain terms a user would say are well covered: "Antithesis workloads", "property catalog", "SDK assertions", "test commands", and "triage".	3 / 3
Distinctiveness Conflict Risk	"Implement Antithesis workloads" is a clear niche with distinct triggers and domain-specific terminology, making conflict with other skills unlikely.	3 / 3
	Total	11 / 12 Passed

Validation

93%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 15 / 16 Passed

Validation for skill structure

Criteria	Description	Result
referenced_paths_exist	Referenced path issues: 1 missing	Warning

	Total	15 / 16 Passed

Repository: antithesishq/antithesis-skills
Commit: 9b75328

Reviewed: about 9 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.