molfeat

Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML.

1.41x

Quality

66%

Does it follow best practices?

Impact

78%

1.41x

Average score across 3 eval scenarios

Securityby

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./scientific-skills/molfeat/SKILL.md

Quality

Discovery

82%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong, domain-specific description that clearly communicates concrete capabilities using precise cheminformatics terminology. Its main weakness is the lack of an explicit 'Use when...' clause, which would help Claude know exactly when to select this skill. The trigger terms are excellent for the target audience.

Suggestions

Add an explicit 'Use when...' clause, e.g., 'Use when the user needs to convert molecular structures to numerical features, mentions SMILES strings, fingerprints, molecular descriptors, or QSAR modeling.'

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions and tools: ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, QSAR, and molecular ML. These are concrete, domain-specific capabilities.	3 / 3
Completeness	The 'what' is well-covered (molecular featurization with specific methods), and the 'when' is implied by listing use cases (QSAR, molecular ML), but there is no explicit 'Use when...' clause with trigger guidance, which caps this at 2 per the rubric.	2 / 3
Trigger Term Quality	Excellent coverage of natural terms a cheminformatics user would use: 'ECFP', 'MACCS', 'SMILES', 'featurization', 'QSAR', 'molecular ML', 'ChemBERTa', 'descriptors'. These are precisely the keywords someone in this domain would mention.	3 / 3
Distinctiveness Conflict Risk	Highly distinctive niche — molecular featurization for ML is a very specific domain. Terms like ECFP, MACCS, SMILES, ChemBERTa, and QSAR are unlikely to conflict with any other skill.	3 / 3
	Total	11 / 12 Passed

Implementation

50%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The skill demonstrates strong actionability with executable, well-structured code examples covering a wide range of molecular featurization use cases. However, it is significantly over-verbose, repeating featurizer examples across multiple sections and including explanatory content Claude doesn't need. The content that should live in the referenced files (detailed featurizer comparisons, multiple workflow examples) is duplicated inline, undermining the progressive disclosure structure.

Suggestions

Reduce the main SKILL.md to ~100-120 lines by moving the detailed featurizer selection guide, common workflows, and reference table into the referenced files (available_featurizers.md, examples.md), keeping only a quick-start example and brief pointers.

Remove the 'When to Use This Skill' and 'Core Concepts' explanatory sections — Claude understands what QSAR, virtual screening, and scikit-learn pipelines are. Replace with a terse 3-line summary.

Integrate validation checkpoints into workflows: after featurization, check for None values from failed molecules and validate feature matrix shape before passing to ML models.

Consolidate the repeated ECFP/MACCS examples — these appear in at least 4 separate sections with near-identical code.

Dimension	Reasoning	Score
Conciseness	The skill is extremely verbose at ~350+ lines. It explains concepts Claude already knows (what QSAR is, what virtual screening is, what similarity searching is), includes a lengthy 'When to Use This Skill' section that's unnecessary, repeats similar code patterns multiple times (e.g., ECFP appears in at least 6 different code blocks), and the 'Choosing the Right Featurizer' section overlaps heavily with 'Common Workflows' and the reference table. Much of this could be condensed to 1/3 the length.	1 / 3
Actionability	The skill provides fully executable, copy-paste ready code examples throughout. Every featurizer recommendation comes with concrete import statements and constructor calls. The QSAR, virtual screening, similarity search, and pipeline integration workflows are all complete and runnable.	3 / 3
Workflow Clarity	Workflows are presented as code blocks rather than clearly sequenced steps with validation checkpoints. The QSAR and virtual screening workflows show the happy path but lack validation steps (e.g., checking for None values from failed featurizations before training, validating feature matrix dimensions). The error handling section exists but is separate from the workflows rather than integrated as checkpoints.	2 / 3
Progressive Disclosure	The skill references three supporting files (api_reference.md, available_featurizers.md, examples.md) with clear 'When to load' guidance and grep tips, which is good. However, the main SKILL.md itself contains far too much inline content that duplicates what should be in those reference files — the extensive featurizer selection guide, multiple workflow examples, and the reference table all belong in the referenced files rather than the overview.	2 / 3
	Total	8 / 12 Passed

Validation

81%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 9 / 11 Passed

Validation for skill structure

Criteria	Description	Result
skill_md_line_count	SKILL.md is long (510 lines); consider splitting into references/ and linking	Warning
metadata_version	'metadata.version' is missing	Warning

	Total	9 / 11 Passed

Repository: K-Dense-AI/claude-scientific-skills
Commit: cbcae7b

Reviewed: 1 day ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.