CtrlK
BlogDocsLog inGet started
Tessl Logo

cross-validation-setup

Cross Validation Setup - Auto-activating skill for ML Training. Triggers on: cross validation setup, cross validation setup Part of the ML Training skill category.

34

0.92x

Quality

3%

Does it follow best practices?

Impact

89%

0.92x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./planned-skills/generated/07-ml-training/cross-validation-setup/SKILL.md
SKILL.md
Quality
Evals
Security

Evaluation results

83%

-17%

Classifier Benchmarking Pipeline

K-fold cross-validation setup with sklearn

Criteria
Without context
With context

Uses sklearn CV

100%

100%

sklearn or compatible framework

100%

100%

Mean and std reported

100%

53%

Multiple classifiers

100%

100%

Reusable functions

100%

0%

No data leakage

100%

100%

results.txt produced

100%

100%

Pip for dependencies

100%

100%

Stratified or appropriate split

100%

100%

Without context: $0.3279 · 1m 34s · 20 turns · 21 in / 4,363 out tokens

With context: $0.5090 · 2m 1s · 31 turns · 62 in / 6,275 out tokens

85%

-5%

Automated Hyperparameter Optimization for a Production Model

Hyperparameter tuning with cross-validated search

Criteria
Without context
With context

CV-based search utility

100%

0%

sklearn framework

100%

100%

Mean CV score reported

100%

100%

search_results.csv produced

100%

100%

run_log.txt produced

100%

100%

Reusable structure

0%

100%

No data leakage

100%

100%

Meaningful param space

100%

100%

Pip for install

100%

100%

CV folds specified

100%

100%

Without context: $0.2426 · 2m 55s · 17 turns · 17 in / 3,295 out tokens

With context: $0.5928 · 4m 27s · 31 turns · 63 in / 6,521 out tokens

100%

End-to-End ML Training Pipeline with Experiment Tracking

Full ML training workflow with CV and experiment tracking

Criteria
Without context
With context

K-fold loop implemented

100%

100%

sklearn/pytorch/tensorflow used

100%

100%

Per-fold data preparation

100%

100%

Per-fold metric recording

100%

100%

Mean and std in summary

100%

100%

fold_metrics.csv produced

100%

100%

experiment_summary.json produced

100%

100%

Reusable code structure

100%

100%

Pip for dependencies

100%

100%

Full workflow coverage

100%

100%

Without context: $0.2386 · 1m 16s · 17 turns · 18 in / 4,034 out tokens

With context: $0.5024 · 1m 56s · 29 turns · 62 in / 6,271 out tokens

Repository
jeremylongshore/claude-code-plugins-plus-skills
Evaluated
Agent
Claude Code
Model
Claude Sonnet 4.6

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.