Methodology for categorizing changes, assessing risks, and creating summaries from any changeset. Triggers: diff analysis, changeset review, risk assessment, change categorization, semantic analysis, release preparation, change summary, git diff Use when: analyzing specific changesets, assessing risk of changes, preparing release notes, categorizing changes by type and impact DO NOT use when: quick context catchup - use catchup instead. DO NOT use when: full PR review - use review-core with pensive skills. Use this skill for systematic change analysis with risk scoring.
93
Quality
87%
Does it follow best practices?
Impact
98%
1.05xAverage score across 6 eval scenarios
Passed
No known issues
Quality
Discovery
89%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a well-structured skill description with strong completeness and distinctiveness. The explicit 'Use when' and 'DO NOT use when' clauses effectively guide skill selection. The main weakness is that the core capabilities could be more concrete - describing specific outputs or methodologies rather than abstract actions like 'assessing risks'.
Suggestions
Add more concrete action specifics, e.g., 'generates risk scores 1-5, categorizes changes as breaking/feature/fix/refactor, produces structured summaries'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Names the domain (changeset analysis) and lists some actions like 'categorizing changes, assessing risks, creating summaries' but these are somewhat abstract. Missing concrete specifics like what risk scoring entails or what output formats are produced. | 2 / 3 |
Completeness | Clearly answers both what (categorizing changes, assessing risks, creating summaries) and when (explicit 'Use when:' clause with triggers, plus helpful 'DO NOT use when:' clauses that distinguish from related skills like catchup and review-core). | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural terms users would say: 'diff analysis', 'changeset review', 'risk assessment', 'release preparation', 'change summary', 'git diff'. These are terms users would naturally use when needing this functionality. | 3 / 3 |
Distinctiveness Conflict Risk | Explicitly distinguishes itself from related skills (catchup, review-core with pensive skills) with clear DO NOT use clauses. The focus on 'systematic change analysis with risk scoring' creates a distinct niche separate from quick context or full PR review. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
85%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-structured methodology skill with excellent workflow clarity and progressive disclosure. The 4-step process with TodoWrite checkpoints provides clear sequencing. The main weakness is limited actionability - while the structure is clear, concrete examples of categorized changes, risk assessments, or sample outputs would make the skill more immediately executable.
Suggestions
Add a concrete example showing sample categorized changes output (e.g., 'Feature additions: 3, Bug fixes: 2, Refactoring: 5')
Include a brief example of what a completed risk assessment looks like with actual risk scores
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The content is lean and efficient, avoiding unnecessary explanations. Each section serves a clear purpose without padding or explaining concepts Claude already knows. | 3 / 3 |
Actionability | The methodology provides clear steps and references to modules, but lacks concrete examples of actual diff analysis output, specific commands, or executable code snippets. The guidance is structural rather than demonstrative. | 2 / 3 |
Workflow Clarity | The 4-step methodology is clearly sequenced with explicit TodoWrite checkpoints for each step. The progressive loading section provides clear conditional logic, and exit criteria define completion state. | 3 / 3 |
Progressive Disclosure | Excellent structure with clear overview, conditional module loading based on context, and well-signaled one-level-deep references to specific modules (semantic-categorization.md, risk-assessment-framework.md, git-diff-patterns.md). | 3 / 3 |
Total | 11 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
6213d1a
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.