Comprehensive toolkit for protein language models including ESM3 (generative multimodal protein design across sequence, structure, and function) and ESM C (efficient protein embeddings and representations). Use this skill when working with protein sequences, structures, or function prediction; designing novel proteins; generating protein embeddings; performing inverse folding; or conducting protein engineering tasks. Supports both local model usage and cloud-based Forge API for scalable inference.
86
82%
Does it follow best practices?
Impact
93%
3.44xAverage score across 3 eval scenarios
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that clearly defines a specialized domain (protein language models), lists specific capabilities and model names, and includes an explicit 'Use this skill when...' clause with comprehensive trigger terms. The description is well-structured, uses third person voice appropriately, and would be highly distinguishable from other skills in a large skill library.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'generative multimodal protein design across sequence, structure, and function', 'efficient protein embeddings and representations', 'inverse folding', 'protein engineering tasks', 'function prediction'. Names specific models (ESM3, ESM C) and deployment modes (local, Forge API). | 3 / 3 |
Completeness | Clearly answers both 'what' (comprehensive toolkit for protein language models with specific capabilities listed) and 'when' (explicit 'Use this skill when...' clause covering multiple trigger scenarios like working with protein sequences, designing novel proteins, generating embeddings, inverse folding, and protein engineering). | 3 / 3 |
Trigger Term Quality | Includes strong natural keywords users would say: 'protein sequences', 'protein structures', 'function prediction', 'designing novel proteins', 'protein embeddings', 'inverse folding', 'protein engineering', 'ESM3', 'ESM C', 'protein language models'. These cover a wide range of terms a bioinformatics user would naturally use. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive niche focused on protein language models (ESM3, ESM C) with domain-specific terminology like 'inverse folding', 'protein embeddings', and 'protein engineering'. Very unlikely to conflict with other skills given the specialized bioinformatics domain. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
64%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a solid, actionable skill with excellent code examples covering the full range of ESM capabilities. Its main weaknesses are the lack of validation/error-handling steps in workflows, some verbosity in explanatory sections that Claude doesn't need, and the absence of the referenced bundle files that would support progressive disclosure. The model selection guide and installation instructions are practical and well-organized.
Suggestions
Add validation checkpoints to workflows—e.g., verify generated sequence length, check for valid amino acids, validate structure output before proceeding to next steps, and include error handling for API failures.
Remove or significantly trim the 'When to use' bullet lists, the 'Resources and Documentation' section, and the 'Responsible Use' section to improve conciseness—these are either inferrable by Claude or not actionable instructions.
Provide the referenced bundle files (references/esm3-api.md, references/esm-c-api.md, references/forge-api.md, references/workflows.md) and move detailed content like the model selection guide and best practices into them to improve progressive disclosure.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is reasonably well-structured but includes some unnecessary content: the 'Resources and Documentation' section with blog posts and community links, the 'Responsible Use' section, and explanatory 'When to use' lists that Claude can infer. The model selection guide is useful but slightly verbose. The overview paragraph restates what the section headers already convey. | 2 / 3 |
Actionability | The skill provides fully executable, copy-paste ready Python code examples for all major use cases: sequence generation, structure prediction, inverse folding, embeddings, function conditioning, chain-of-thought generation, and batch processing. Installation commands are concrete and specific. | 3 / 3 |
Workflow Clarity | The chain-of-thought generation section shows a clear multi-step workflow, and the model selection guide provides good decision criteria. However, there are no validation checkpoints or error recovery steps for any of the workflows—no verification that generated proteins are valid, no error handling patterns, and no feedback loops for failed generations. | 2 / 3 |
Progressive Disclosure | The skill references four separate reference files (esm3-api.md, esm-c-api.md, forge-api.md, workflows.md) with clear descriptions, which is good structure. However, no bundle files are actually provided, so these references are broken. Additionally, the main file is quite long (~200+ lines) with substantial inline content that could be better distributed to the referenced files. | 2 / 3 |
Total | 9 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
metadata_version | 'metadata.version' is missing | Warning |
Total | 10 / 11 Passed | |
cbcae7b
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.