This skill trains machine learning models using automated workflows. It analyzes datasets, selects appropriate model types (classification, regression, etc.), configures training parameters, trains the model with cross-validation, generates performance metrics, and saves the trained model artifact. Use this skill when the user requests to "train" a model, needs to evaluate a dataset for machine learning purposes, or wants to optimize model performance. The skill supports common frameworks like scikit-learn.
90
49%
Does it follow best practices?
Impact
98%
1.07xAverage score across 9 eval scenarios
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./backups/skills-migration-20251108-070147/plugins/ai-ml/ml-model-trainer/skills/ml-model-trainer/SKILL.mdClassification model training and evaluation
scikit-learn used
100%
100%
Classification algorithm
100%
100%
Cross-validation applied
0%
100%
Accuracy metric reported
100%
100%
Precision metric reported
100%
100%
Recall metric reported
100%
100%
Model artifact saved
100%
100%
Target variable identified
100%
100%
Model type determined
100%
100%
Data cleaned before training
100%
100%
Regression model training and evaluation
scikit-learn used
100%
100%
Regression algorithm
100%
100%
Cross-validation applied
0%
100%
MSE metric reported
100%
100%
R-squared metric reported
100%
100%
Model artifact saved
100%
100%
Target variable identified
100%
100%
Model type determined
100%
100%
Performance output file
100%
100%
Data formatted before training
100%
100%
Data preprocessing, feature engineering, hyperparameter tuning
scikit-learn used
100%
100%
Missing values handled
100%
100%
Categorical encoding
100%
100%
Feature engineering applied
100%
100%
Hyperparameter tuning
100%
100%
Cross-validation applied
100%
100%
Best params documented
100%
100%
Model artifact saved
100%
100%
Performance metrics reported
100%
100%
Target variable identified
100%
100%
Multi-class classification and metric reporting
scikit-learn used
100%
100%
Classification algorithm
100%
100%
Multi-class handling
100%
100%
Cross-validation applied
100%
100%
Accuracy metric reported
100%
100%
Precision metric reported
100%
100%
Recall metric reported
100%
100%
Target variable identified
100%
100%
Model type determined
100%
100%
Data prepared before training
100%
100%
Model artifact saved
100%
100%
Model persistence and inference workflow
scikit-learn used
100%
100%
Correct model type
100%
100%
Cross-validation applied
0%
100%
MSE metric reported
100%
50%
R-squared metric reported
100%
100%
Model artifact saved
100%
100%
Inference script loads model
100%
100%
Predictions produced
100%
100%
Target variable identified
100%
100%
Data prepared before training
100%
100%
Performance output file
100%
100%
Algorithm selection and cross-validated comparison
scikit-learn used
100%
100%
Regression algorithms chosen
100%
100%
Multiple models compared
100%
100%
Cross-validation applied
100%
100%
MSE reported per model
50%
50%
R-squared reported per model
100%
100%
Best model identified
100%
100%
Model artifact saved
100%
100%
Target variable identified
100%
100%
Data prepared before training
100%
100%
Results written to file
100%
100%
Text feature extraction and classification
scikit-learn used
100%
100%
Text features extracted
100%
100%
Classification algorithm
100%
100%
Model type determined
100%
100%
Cross-validation applied
100%
100%
Accuracy metric reported
100%
100%
Precision metric reported
100%
100%
Recall metric reported
100%
100%
Target variable identified
100%
100%
Model artifact saved
100%
100%
Data prepared before training
100%
100%
Regression with feature engineering
scikit-learn used
100%
100%
Regression algorithm
100%
100%
Model type determined
100%
100%
Feature engineering applied
100%
100%
Cross-validation applied
0%
100%
MSE metric reported
30%
50%
R-squared metric reported
100%
100%
Target variable identified
100%
100%
Data prepared before training
100%
87%
Model artifact saved
100%
100%
Performance output file
100%
100%
Imbalanced dataset classification
scikit-learn used
100%
100%
Classification algorithm
100%
100%
Model type determined
100%
100%
Cross-validation applied
100%
100%
Stratified cross-validation
100%
100%
Accuracy metric reported
100%
100%
Precision metric reported
100%
100%
Recall metric reported
100%
100%
Target variable identified
100%
100%
Data prepared before training
100%
100%
Model artifact saved
100%
100%
Performance output file
100%
100%
13d35b8
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.