Writes rigorous mathematical proofs for ML/AI theory. Use when asked to prove a theorem, lemma, proposition, or corollary, fill in missing proof steps, formalize a proof sketch, 补全证明, 写证明, 证明某个命题, or determine whether a claimed proof can actually be completed under the stated assumptions.
71
88%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that clearly defines its scope (mathematical proofs for ML/AI theory), lists multiple concrete actions, and provides comprehensive trigger terms including multilingual variants. The explicit 'Use when...' clause with diverse trigger scenarios makes it highly actionable for skill selection. The description is concise yet thorough, with minimal risk of conflicting with other skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'prove a theorem, lemma, proposition, or corollary', 'fill in missing proof steps', 'formalize a proof sketch', and 'determine whether a claimed proof can actually be completed under the stated assumptions'. These are distinct, well-defined tasks. | 3 / 3 |
Completeness | Clearly answers both 'what' (writes rigorous mathematical proofs for ML/AI theory) and 'when' with an explicit 'Use when...' clause listing multiple specific trigger scenarios. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms including 'prove a theorem', 'lemma', 'proposition', 'corollary', 'proof steps', 'proof sketch', and even Chinese-language equivalents ('补全证明', '写证明', '证明某个命题'). These are terms users would naturally use when requesting proof assistance. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive — scoped specifically to mathematical proofs in the ML/AI theory domain, with very specific trigger terms like 'prove a theorem', 'formalize a proof sketch', and 'fill in missing proof steps'. Unlikely to conflict with general math, coding, or writing skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a strong, well-designed skill for mathematical proof writing that excels in actionability and workflow clarity. The multi-step process with feasibility triage before proof writing and final verification afterward creates robust guardrails against fabricated proofs. The main weakness is moderate verbosity—some sections could be tightened, and the content could benefit from splitting detailed templates into separate reference files.
Suggestions
Tighten Steps 1 and 2 by merging overlapping extraction/normalization checklists to reduce redundancy.
Consider extracting the 'Required File Structure' template into a separate PROOF_TEMPLATE.md file referenced from the main skill.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is reasonably well-structured but contains some redundancy (e.g., the extraction checklist in Step 1 and Step 2 overlap significantly, and some instructions like 'never write math in plain text' are repeated implicitly). Several points explain things Claude already knows about proof methodology, though the domain-specific constraints (e.g., never use 'clearly' or 'obviously') add genuine value. | 2 / 3 |
Actionability | The skill provides highly concrete, specific guidance: exact file structure template, explicit status categories, detailed checklists for verification, specific banned phrases, and clear output format. While there's no executable code (appropriate for a proof-writing skill), every instruction is actionable and unambiguous. | 3 / 3 |
Workflow Clarity | The 6-step workflow is clearly sequenced with explicit validation checkpoints (Step 3 feasibility triage before writing, Step 6 final verification with specific checks). The feedback loop is present: if a step can't be justified, downgrade status and write a blockage report instead. The three output modes handle all possible outcomes clearly. | 3 / 3 |
Progressive Disclosure | The content is well-organized with clear sections and headers, but it's a long monolithic document with no references to supporting files. The detailed file structure template, output modes, and key rules could potentially be split into separate reference files. However, given no bundle files exist, the inline approach is somewhat justified. | 2 / 3 |
Total | 10 / 12 Passed |
Validation
81%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 9 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
allowed_tools_field | 'allowed-tools' contains unusual tool name(s) | Warning |
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 9 / 11 Passed | |
a425a71
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.