ml-pipeline-workflow

Build end-to-end MLOps pipelines from data preparation through model training, validation, and production deployment. Use when creating ML pipelines, implementing MLOps practices, or automating model training and deployment workflows.

0.98x

Quality

56%

Does it follow best practices?

Impact

73%

0.98x

Average score across 3 eval scenarios

Securityby

Passed

No known issues

Fix and improve this skill with Tessl

tessl review fix ./tests/ext_conformance/artifacts/agents-wshobson/machine-learning-ops/skills/ml-pipeline-workflow/SKILL.md

Evaluation results

64%

Customer Churn Prediction: Data Preparation Pipeline

Data preparation pipeline with validation and lineage

Criteria

Baseline

With context

Data validation library

Dataset versioning tool

Feature engineering documentation

100%

Data lineage tracking

70%

80%

Stage-level metric logging

100%

Pipeline stage ordering

100%

Stage modularity

100%

Idempotency design

Train/val/test split

50%

Input/output validation at boundaries

100%

Failure handling

62%

100%

Fraud Detection Model: Safe Production Rollout

Production deployment strategy with canary and rollback

Criteria

Baseline

With context

Shadow deployment stage

100%

Canary release stage

100%

A/B testing infrastructure

100%

Automated rollback trigger

100%

Rollback mechanism

100%

Latency monitoring

100%

Throughput monitoring

37%

100%

Model performance drift monitoring

100%

Separated training/serving infra

100%

No direct hard cutover

100%

Production traffic validation

100%

Serving platform reference

50%

100%

56%

-17%

House Price Model: Reproducible Training Pipeline

Model training pipeline with experiment tracking and registry

Criteria

Baseline

With context

Experiment tracking tool

100%

Model registry usage

100%

16%

Named pipeline stages

100%

Per-stage metric logging

20%

30%

Data version tracking

100%

Code version tracking

25%

87%

Model version tracking

100%

Hyperparameter logging

100%

Validation stage present

100%

Failure handling

62%

Stage idempotency

Repository: Dicklesworthstone/pi_agent_rust
Path: tests/ext_conformance/artifacts/agents-wshobson/machine-learning-ops/skills/ml-pipeline-workflow/SKILL.md
Commit: b3dd482

Evaluated: 5 months ago
Agent: Claude Code
Model: Claude Sonnet 4.6

Table of Contents

Customer Churn Prediction: Data Preparation Pipeline House Price Model: Reproducible Training Pipeline Fraud Detection Model: Safe Production Rollout

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.