Debug proof failures using counterexamples from Nitpick (Isabelle) or QuickChick (Coq) to identify specification errors, missing preconditions, and proof strategy issues. Use when: (1) A proof attempt fails and you need to understand why, (2) Counterexamples are generated by Nitpick or QuickChick, (3) Specifications may be incorrect or incomplete, (4) Theorems need validation before proving, (5) Missing preconditions or lemmas need identification, or (6) Proof failures need explanation and correction suggestions. Supports both Isabelle/HOL and Coq equally.
93
92%
Does it follow best practices?
Impact
93%
1.10xAverage score across 3 eval scenarios
Passed
No known issues
Missing precondition identification
Manual verification
100%
100%
Empty list pattern
75%
100%
Undefined hd behavior
30%
80%
Root cause: missing precondition
100%
100%
Corrected theorem syntax
100%
100%
Impact assessment
100%
100%
Step-by-step explanation
100%
100%
Retest recommendation
100%
100%
No counterexample caveat
75%
75%
Completeness check
100%
100%
Root cause not symptom
100%
100%
Duplicate element pattern and comparison operator fix
Manual computation
100%
100%
Duplicate element pattern
100%
100%
Strict vs non-strict inequality
100%
100%
Correct root cause category
100%
100%
Corrected definition
100%
100%
Before/after code
100%
100%
Shrinking interpretation
0%
0%
Retest recommendation
100%
100%
Success does not mean proof
37%
50%
Completeness: permutation missing
0%
100%
Step-by-step violation
100%
100%
Quantifier order and incomplete specification
Quantifier root cause (A)
100%
100%
∃∀ vs ∀∃ logical difference (A)
80%
100%
Corrected quantifier theorem (A)
100%
100%
Witness for corrected theorem (A)
100%
100%
Theorem A is fundamentally wrong (A)
100%
100%
Manual trace for [1, 2, 0] (B)
100%
100%
sorted [1, 0, 2] is False (B)
100%
100%
Implementation bug root cause (B)
100%
100%
Incomplete spec note (B)
75%
100%
Strengthened spec (B)
62%
100%
Retest recommendation
62%
100%
No counterexample caveat
50%
62%
0f00a4f
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.