When the user needs to review an existing contract, assess risk in proposed terms, or evaluate a contract before signing.
65
57%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/contract-review/SKILL.mdQuality
Discovery
50%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
The description effectively communicates when to use the skill (contract review scenarios) but is weak on specifying what concrete actions or outputs the skill provides. It has decent trigger terms for contract-related queries but misses common synonyms and variations. The description reads more like a trigger condition than a complete skill description.
Suggestions
Add specific capabilities/actions the skill performs, e.g., 'Identifies problematic clauses, flags liability risks, summarizes key terms and obligations, and highlights missing protections in contracts.'
Expand trigger terms to include common variations like 'agreement', 'NDA', 'legal terms', 'clause', 'liability', '.docx contract files'
Restructure to lead with 'what it does' followed by 'Use when...' for clarity, e.g., 'Analyzes contracts to identify risky clauses, summarize obligations, and flag missing protections. Use when the user needs to review a contract, assess risk in proposed terms, or evaluate an agreement before signing.'
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description names the domain (contracts) and some actions ('review', 'assess risk', 'evaluate'), but doesn't list specific concrete outputs or capabilities like identifying problematic clauses, summarizing terms, comparing against standards, or flagging liability issues. | 2 / 3 |
Completeness | The description answers 'when' clearly ('When the user needs to review... assess risk... evaluate before signing') but the 'what' is weak — it doesn't explain what the skill actually does or produces. It describes triggers but not concrete capabilities or outputs. | 2 / 3 |
Trigger Term Quality | Includes relevant terms like 'contract', 'risk', 'proposed terms', and 'signing', which users might naturally say. However, it misses common variations like 'agreement', 'NDA', 'legal review', 'clause analysis', 'red flags', or 'due diligence'. | 2 / 3 |
Distinctiveness Conflict Risk | The focus on contract review and risk assessment is somewhat specific, but could overlap with general legal analysis skills, document review skills, or compliance-related skills. The term 'evaluate' is broad enough to cause potential conflicts. | 2 / 3 |
Total | 8 / 12 Passed |
Implementation
64%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-crafted, highly actionable contract review skill with excellent domain-specific frameworks and realistic examples that demonstrate expected output quality. Its main weaknesses are verbosity (the document is quite long with extensive inline reference material that could be split out) and the lack of explicit validation checkpoints in the workflow for what is a high-stakes analysis task. The red/yellow/green framework and negotiation principles add genuine value beyond Claude's baseline knowledge.
Suggestions
Split the Red/Yellow/Green flag definitions and detailed examples into separate referenced files (e.g., FLAGS.md, EXAMPLES.md) to reduce the main skill's token footprint while preserving the reference material.
Add explicit validation checkpoints to the workflow, such as confirming all contract sections have been reviewed, verifying assumptions about the user's context before analysis, and cross-checking flagged items against the missing protections checklist.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is thorough but verbose in places. The 'When to Use' section over-explains activation triggers, the red/yellow/green flag lists are extensive but useful reference material, and the examples are lengthy. Some trimming is possible (e.g., the 'When to Use' trigger phrases), but most content earns its place as domain-specific knowledge Claude wouldn't inherently know about startup contract review priorities. | 2 / 3 |
Actionability | The skill provides highly concrete, actionable guidance: specific clause language to look for, exact alternative language to suggest, a structured output format with a table template, detailed examples showing real contract clauses and recommended revisions. The red/yellow/green framework gives precise criteria for classification rather than vague direction. | 3 / 3 |
Workflow Clarity | The 6-step workflow is clearly sequenced and logical, but it lacks explicit validation checkpoints or feedback loops. For contract review—which involves risk assessment of binding legal documents—there should be verification steps (e.g., confirming all sections were reviewed, cross-checking flagged items against the missing protections list, or asking the user to confirm context assumptions before proceeding with analysis). | 2 / 3 |
Progressive Disclosure | The content is well-structured with clear headers and sections, and it references related skills. However, the extensive red/yellow/green flag lists and two detailed examples make this a long monolithic document. The flag framework and examples could be split into referenced files (e.g., FLAGS.md, EXAMPLES.md) to keep the main skill leaner while preserving discoverability. | 2 / 3 |
Total | 9 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
frontmatter_unknown_keys | Unknown frontmatter key(s) found; consider removing or moving to metadata | Warning |
Total | 10 / 11 Passed | |
4ad31b4
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.