CtrlK
BlogDocsLog inGet started
Tessl Logo

ml-model-training

Train ML models with scikit-learn, PyTorch, TensorFlow. Use for classification/regression, neural networks, hyperparameter tuning, or encountering overfitting, underfitting, convergence issues.

68

Quality

82%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Content

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a solid, actionable skill with excellent executable code examples covering multiple ML frameworks and common pitfalls. Its main weaknesses are verbosity in the Known Issues section (explaining concepts Claude already understands) and a lack of explicit validation checkpoints in the workflow. The progressive disclosure structure is reasonable but the main file could be leaner by moving detailed troubleshooting to a reference file.

Suggestions

Trim the 'Known Issues Prevention' section by removing the 'Problem' explanations and keeping only the solution code with brief labels (e.g., '### Data Leakage: fit scaler on train only').

Add explicit validation checkpoints to the workflow, such as 'Check for data quality issues after preparation' and 'Compare train vs. val metrics to detect overfitting before final test evaluation'.

Move the detailed Known Issues content to a reference file (e.g., references/common-issues.md) and keep only a brief summary with links in the main SKILL.md.

DimensionReasoningScore

Conciseness

The skill is mostly efficient with executable code examples, but the 'Known Issues Prevention' section is quite lengthy and explains concepts Claude likely already knows (e.g., what data leakage is, what class imbalance means). The problem descriptions could be trimmed significantly, keeping just the solutions.

2 / 3

Actionability

The skill provides fully executable, copy-paste ready code for data preparation, scikit-learn training, PyTorch training, hyperparameter tuning, and common issue fixes. Code examples are concrete with specific imports, parameters, and patterns.

3 / 3

Workflow Clarity

The high-level workflow (Data Preparation → Feature Engineering → Model Selection → Training → Evaluation) is listed but lacks explicit validation checkpoints between steps. There's no feedback loop for checking data quality after preparation, no validation step after model training before proceeding to evaluation, and no guidance on what to do if metrics are poor.

2 / 3

Progressive Disclosure

References to pytorch-training.md and tensorflow-keras.md are well-signaled with clear descriptions of their contents, but no bundle files were provided to verify they exist. The main file itself is quite long (~180 lines) with the Known Issues section containing substantial inline content that could be split into a reference file.

2 / 3

Total

9

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong skill description that concisely covers specific frameworks, concrete tasks, and troubleshooting scenarios. It uses third person voice correctly and includes an explicit 'Use for' clause with natural trigger terms. The description is well-structured, covering both the 'what' and 'when' effectively without unnecessary verbosity.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions and frameworks: 'Train ML models with scikit-learn, PyTorch, TensorFlow' and specific tasks like 'classification/regression, neural networks, hyperparameter tuning' plus troubleshooting scenarios like 'overfitting, underfitting, convergence issues'.

3 / 3

Completeness

Clearly answers both what ('Train ML models with scikit-learn, PyTorch, TensorFlow') and when ('Use for classification/regression, neural networks, hyperparameter tuning, or encountering overfitting, underfitting, convergence issues'). The 'Use for' clause serves as an explicit trigger guidance.

3 / 3

Trigger Term Quality

Includes strong natural keywords users would say: 'scikit-learn', 'PyTorch', 'TensorFlow', 'classification', 'regression', 'neural networks', 'hyperparameter tuning', 'overfitting', 'underfitting', 'convergence issues'. These cover both framework names and common ML problem terms.

3 / 3

Distinctiveness Conflict Risk

Clearly scoped to ML model training with specific frameworks and problem types. The combination of named frameworks (scikit-learn, PyTorch, TensorFlow) and specific ML concepts (hyperparameter tuning, overfitting, convergence) creates a distinct niche unlikely to conflict with other skills.

3 / 3

Total

12

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

metadata_version

'metadata.version' is missing

Warning

Total

10

/

11

Passed

Repository
secondsky/claude-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.