Automatically generates executable test cases from model checking counterexample traces. Translates abstract counterexample states and transitions into concrete test inputs, execution steps, and assertions that reproduce property violations. Use when working with model checker outputs (SPIN, CBMC, NuSMV, TLA+, Java PathFinder, etc.) and needing to create regression tests, validate bug fixes, or reproduce verification failures in executable test suites.
90
85%
Does it follow best practices?
Impact
99%
1.16xAverage score across 3 eval scenarios
Passed
No known issues
NuSMV counterexample to Python test
pytest framework used
25%
100%
Setup phase present
87%
100%
Execution phase present
87%
100%
Assertion phase present
87%
100%
Traceability comments
100%
100%
Concrete value mapping
100%
100%
Property violation comment
100%
100%
Mapping document produced
100%
100%
Execution instructions provided
100%
100%
Expected behavior documented
100%
100%
Model checker format identified
100%
100%
Non-default values documented
100%
100%
CBMC counterexample to C++ Google Test
Google Test framework
0%
100%
Setup phase initializes CE state
87%
100%
Execution phase replays steps
87%
100%
Assertion phase targets violation
75%
100%
Traceability comments
80%
100%
Concrete values used
100%
100%
Minimal complexity
62%
62%
Property context comment
87%
100%
Mapping document produced
88%
100%
Execution instructions provided
50%
100%
Expected failure described
87%
100%
Model checker identified
100%
100%
JPF non-determinism and traceability
JUnit framework used
0%
100%
Non-determinism documented
83%
100%
Setup phase initializes CE state
100%
100%
Execution phase replays transitions
100%
100%
Assertion phase targets violation
100%
100%
JPF traceability comments
90%
100%
Concrete values used
100%
100%
Property context comment
100%
100%
Mapping document produced
100%
100%
Execution instructions provided
100%
100%
Expected failure described
100%
100%
Model checker identified
100%
100%
0f00a4f
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.