CtrlK
BlogDocsLog inGet started
Tessl Logo

antithesis-workload

Implement Antithesis workloads by turning the property catalog into SDK assertions and test commands, then refine coverage after triage.

48

Quality

51%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./antithesis-workload/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

62%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured orchestration skill with clear workflows, good scoping logic, and thorough validation criteria. Its main weaknesses are verbosity (especially the inline self-review checklist) and lack of concrete executable examples — it delegates all implementation specifics to reference files without providing even minimal inline code snippets to anchor understanding. The workflow clarity is strong with explicit decision points and validation steps.

Suggestions

Add at least one inline code example showing a concrete SDK assertion call (e.g., an Always assertion for a safety property) so the skill is actionable without reading reference files first.

Move the self-review checklist to a reference file (e.g., references/self-review-checklist.md) and reference it from the main skill to reduce inline verbosity.

Add a minimal concrete example of a test command file structure (filename with valid prefix, basic skeleton) to make the test command guidance actionable without requiring references/test-commands.md.

DimensionReasoningScore

Conciseness

The skill is thorough but verbose in places — the self-review checklist alone is ~30 lines of detailed criteria that could be in a reference file. The scoping section explains recommendation strategy at length. However, most content is genuinely instructive rather than explaining things Claude already knows, so it's not a 1.

2 / 3

Actionability

The skill provides clear workflows and specific file paths, but lacks executable code examples — no SDK assertion code snippets, no sample test command implementations. It relies heavily on external reference files (references/assertions.md, references/test-commands.md, etc.) for the concrete details, making the skill itself more of a coordination document than a hands-on guide.

2 / 3

Workflow Clarity

The workflows are clearly sequenced with explicit steps, validation checkpoints (self-review criteria, snouty validate), and feedback loops (post-triage iteration cycle). The scoping section provides clear decision trees for different user scenarios. The self-review section serves as a comprehensive validation checklist before completion.

3 / 3

Progressive Disclosure

The skill has a clear reference table pointing to one-level-deep reference files and external documentation, which is good structure. However, the self-review checklist and detailed scoping guidance are inlined when they could be in reference files, making the main document longer than necessary. No bundle files were provided to verify the referenced paths exist.

2 / 3

Total

9

/

12

Passed

Description

40%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description targets a very specific niche (Antithesis workloads) which gives it strong distinctiveness, but it lacks a 'Use when...' clause entirely, making it incomplete for skill selection purposes. The actions described are somewhat abstract and the trigger terms are overly technical, missing natural language variations a user might employ.

Suggestions

Add an explicit 'Use when...' clause, e.g., 'Use when the user asks about implementing Antithesis workloads, writing property-based tests for Antithesis, or converting a property catalog into test assertions.'

Include more natural trigger terms users might say, such as 'Antithesis testing', 'deterministic simulation testing', 'fault injection workloads', or 'property-based testing'.

Make the actions more concrete by specifying which SDK (e.g., Antithesis SDK), what types of assertions (e.g., 'always/sometimes assertions'), and what 'refine coverage after triage' means in practice.

DimensionReasoningScore

Specificity

The description names a specific domain (Antithesis workloads) and mentions some actions ('turning the property catalog into SDK assertions and test commands', 'refine coverage after triage'), but these are somewhat abstract and not fully concrete—e.g., what SDK, what kind of assertions, what does 'refine coverage' entail specifically?

2 / 3

Completeness

The description addresses 'what' (implement Antithesis workloads by converting property catalog to SDK assertions) but completely lacks a 'Use when...' clause or any explicit trigger guidance for when Claude should select this skill. Per the rubric, a missing 'Use when...' clause caps completeness at 2, and the 'when' is not even implied clearly, so it scores 1.

1 / 3

Trigger Term Quality

Includes domain-specific terms like 'Antithesis', 'workloads', 'property catalog', 'SDK assertions', and 'triage', which are relevant but fairly technical. Missing more natural user-facing trigger terms a user might say, such as 'testing', 'fault injection', 'deterministic testing', or 'Antithesis setup'.

2 / 3

Distinctiveness Conflict Risk

The description is highly specific to the Antithesis testing platform and its property catalog workflow, making it very unlikely to conflict with other skills. The niche is clearly defined.

3 / 3

Total

8

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
antithesishq/antithesis-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.