Matter budgeting and ongoing WIP/variance monitoring. Build phase-based fee estimates at matter setup, run bottom-up budgets by jurisdiction or workstream, calculate contingency, and structure AFA arrangements (fixed fee, capped fee, phased fixed fees). Ongoing monitoring: WIP tracking against budget, proportionality assessment (spend vs progress), variance commentary with root cause analysis, forecast-to-complete, realisation monitoring, write-off analysis. Trigger on: 'build a budget', 'fee estimate', 'what will this cost', 'WIP review', 'budget vs actual', 'how are we tracking against budget', 'we're over budget', 'realisation is poor', 'what's our ETC', 'budget for the German workstream', 'model the financial impact of this scope change', 'draft a fee adjustment', 'write-off analysis', 'how much contingency', 'AFA structure', 'fixed fee estimate', 'budget update', 'forecast to complete'.
90
88%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that thoroughly covers specific capabilities in legal matter budgeting and WIP monitoring, provides extensive natural trigger terms, and clearly delineates both what the skill does and when it should be used. The domain-specific terminology ensures strong distinctiveness. The description uses appropriate third-person voice throughout.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description lists numerous specific concrete actions: build phase-based fee estimates, run bottom-up budgets by jurisdiction/workstream, calculate contingency, structure AFA arrangements, WIP tracking against budget, proportionality assessment, variance commentary with root cause analysis, forecast-to-complete, realisation monitoring, and write-off analysis. | 3 / 3 |
Completeness | Clearly answers both 'what does this do' (matter budgeting, WIP/variance monitoring, fee estimates, AFA structuring, etc.) and 'when should Claude use it' with an explicit 'Trigger on:' clause containing extensive trigger phrases. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural trigger terms users would actually say, including conversational phrases like 'what will this cost', 'we're over budget', 'how are we tracking against budget', 'realisation is poor', alongside more formal terms like 'fee estimate', 'WIP review', 'budget vs actual', 'forecast to complete'. Covers both casual and professional language. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive with a clear niche in legal matter budgeting and financial monitoring. Domain-specific terms like 'matter', 'AFA arrangements', 'realisation monitoring', 'WIP tracking', and 'workstream' make it very unlikely to conflict with general budgeting or financial skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
77%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a highly actionable and well-structured skill with excellent workflow clarity, explicit validation checkpoints, and concrete output templates for every operating mode. Its main weakness is length — the domain knowledge sections are thorough but verbose, explaining some concepts that could be assumed or moved to reference files. The progressive disclosure could be improved by extracting detailed domain knowledge into linked reference documents while keeping the SKILL.md as a leaner operational guide.
Suggestions
Extract the Domain Knowledge sections (Budget Build and WIP Monitoring) into separate reference files (e.g., BUDGET_BUILD_REFERENCE.md, WIP_MONITORING_REFERENCE.md) and link to them from the main skill, keeping only the most critical rules inline.
Tighten explanatory prose — for example, the proportionality test explanation ('60% of budget consumed with 60% of work complete: on track') could be reduced to a compact table or formula rather than narrative examples that Claude can infer.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is comprehensive and most content earns its place, but there is notable verbosity in areas like explaining what proportionality means, what realisation means, and the extended self-reported completion scepticism section. Some domain knowledge sections explain concepts an LPM-focused Claude would already understand. The document is ~400+ lines where ~250-300 could convey the same information. | 2 / 3 |
Actionability | Highly actionable throughout: exact table column headers are specified, query draft templates with fill-in fields are provided, variance commentary blocks have concrete formats, status values are enumerated precisely, and calculation methods (burn rate, remaining work) are explicitly defined with formulas. The skill provides copy-paste-ready output structures for every mode. | 3 / 3 |
Workflow Clarity | Each of the five operating modes has clear sequencing with explicit validation checkpoints. The four-question variance analysis framework, the underspend verification steps, the completion scepticism challenge (with required language), the financial disclosure sequencing gate (reconcile → process write-offs → confirm with partner → then communicate), and the query/chase loop all demonstrate robust feedback loops and error recovery patterns. | 3 / 3 |
Progressive Disclosure | The skill references cross-skill handoffs well and has clear section organization, but the document itself is monolithic — the domain knowledge sections for Budget Build and WIP Monitoring are extensive inline content that could be split into reference files. There are no references to supplementary files for detailed guidance (e.g., AFA structures, variance analysis frameworks, or rate benchmarks could be separate documents). | 2 / 3 |
Total | 10 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
8f9093f
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.