esm

Comprehensive toolkit for protein language models including ESM3 (generative multimodal protein design across sequence, structure, and function) and ESM C (efficient protein embeddings and representations). Use this skill when working with protein sequences, structures, or function prediction; designing novel proteins; generating protein embeddings; performing inverse folding; or conducting protein engineering tasks. Supports both local model usage and cloud-based Forge API for scalable inference.

3.44x

Quality

82%

Does it follow best practices?

Impact

93%

3.44x

Average score across 3 eval scenarios

Securityby

Passed

No known issues

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly defines a specialized domain (protein language models), lists specific capabilities and model names, and includes an explicit 'Use this skill when...' clause with comprehensive trigger terms. The description is well-structured, uses third person voice appropriately, and would be highly distinguishable from other skills in a large skill library.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: 'generative multimodal protein design across sequence, structure, and function', 'efficient protein embeddings and representations', 'inverse folding', 'protein engineering tasks', 'function prediction'. Names specific models (ESM3, ESM C) and deployment modes (local, Forge API).	3 / 3
Completeness	Clearly answers both 'what' (comprehensive toolkit for protein language models with specific capabilities listed) and 'when' (explicit 'Use this skill when...' clause covering multiple trigger scenarios like working with protein sequences, designing novel proteins, generating embeddings, inverse folding, and protein engineering).	3 / 3
Trigger Term Quality	Includes strong natural keywords users would say: 'protein sequences', 'protein structures', 'function prediction', 'designing novel proteins', 'protein embeddings', 'inverse folding', 'protein engineering', 'ESM3', 'ESM C', 'protein language models'. These cover a wide range of terms a bioinformatics user would naturally use.	3 / 3
Distinctiveness Conflict Risk	Highly distinctive niche focused on protein language models (ESM3, ESM C) with domain-specific terminology like 'inverse folding', 'protein embeddings', and 'protein engineering'. Very unlikely to conflict with other skills given the specialized bioinformatics domain.	3 / 3
	Total	12 / 12 Passed

Implementation

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a solid, actionable skill with excellent code examples covering the full range of ESM capabilities. Its main weaknesses are the lack of validation/error-handling steps in workflows, some verbosity in explanatory sections that Claude doesn't need, and the absence of the referenced bundle files that would support progressive disclosure. The model selection guide and installation instructions are practical and well-organized.

Suggestions

Add validation checkpoints to workflows—e.g., verify generated sequence length, check for valid amino acids, validate structure output before proceeding to next steps, and include error handling for API failures.

Remove or significantly trim the 'When to use' bullet lists, the 'Resources and Documentation' section, and the 'Responsible Use' section to improve conciseness—these are either inferrable by Claude or not actionable instructions.

Provide the referenced bundle files (references/esm3-api.md, references/esm-c-api.md, references/forge-api.md, references/workflows.md) and move detailed content like the model selection guide and best practices into them to improve progressive disclosure.

Dimension	Reasoning	Score
Conciseness	The skill is reasonably well-structured but includes some unnecessary content: the 'Resources and Documentation' section with blog posts and community links, the 'Responsible Use' section, and explanatory 'When to use' lists that Claude can infer. The model selection guide is useful but slightly verbose. The overview paragraph restates what the section headers already convey.	2 / 3
Actionability	The skill provides fully executable, copy-paste ready Python code examples for all major use cases: sequence generation, structure prediction, inverse folding, embeddings, function conditioning, chain-of-thought generation, and batch processing. Installation commands are concrete and specific.	3 / 3
Workflow Clarity	The chain-of-thought generation section shows a clear multi-step workflow, and the model selection guide provides good decision criteria. However, there are no validation checkpoints or error recovery steps for any of the workflows—no verification that generated proteins are valid, no error handling patterns, and no feedback loops for failed generations.	2 / 3
Progressive Disclosure	The skill references four separate reference files (esm3-api.md, esm-c-api.md, forge-api.md, workflows.md) with clear descriptions, which is good structure. However, no bundle files are actually provided, so these references are broken. Additionally, the main file is quite long (~200+ lines) with substantial inline content that could be better distributed to the referenced files.	2 / 3
	Total	9 / 12 Passed

Validation

90%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 10 / 11 Passed

Validation for skill structure

Criteria	Description	Result
metadata_version	'metadata.version' is missing	Warning

	Total	10 / 11 Passed

Repository: K-Dense-AI/claude-scientific-skills
Commit: cbcae7b

Reviewed: about 13 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.