CtrlK
BlogDocsLog inGet started
Tessl Logo

ml-pipeline-workflow

Build end-to-end MLOps pipelines from data preparation through model training, validation, and production deployment. Use when creating ML pipelines, implementing MLOps practices, or automating model training and deployment workflows.

39

Quality

37%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/machine-learning-ops/skills/ml-pipeline-workflow/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

67%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description has a solid structure with an explicit 'Use when' clause and covers the MLOps domain adequately. However, it stays at a relatively high level of abstraction, listing pipeline stages rather than specific concrete actions, and could benefit from more natural trigger terms and variations that users would actually type. The distinctiveness could be improved by specifying what makes this different from general ML or DevOps skills.

Suggestions

Add more specific concrete actions beyond pipeline stages, e.g., 'configure experiment tracking, set up model registries, implement feature stores, automate retraining schedules, manage model versioning'.

Expand trigger terms with natural variations users would say: 'machine learning pipeline', 'model serving', 'CI/CD for ML', 'experiment tracking', 'model monitoring', 'MLflow', 'Kubeflow'.

DimensionReasoningScore

Specificity

Names the domain (MLOps) and lists some actions ('data preparation', 'model training', 'validation', 'production deployment'), but these are high-level pipeline stages rather than multiple specific concrete actions like 'create feature stores, configure hyperparameter tuning, set up model registries, implement A/B testing'.

2 / 3

Completeness

Clearly answers both 'what' (build end-to-end MLOps pipelines from data preparation through model training, validation, and production deployment) and 'when' (explicit 'Use when' clause covering creating ML pipelines, implementing MLOps practices, or automating model training and deployment workflows).

3 / 3

Trigger Term Quality

Includes relevant keywords like 'ML pipelines', 'MLOps', 'model training', 'deployment workflows', but misses common natural variations users might say such as 'machine learning', 'CI/CD for models', 'model serving', 'experiment tracking', 'feature engineering', 'model registry', or specific tools like 'Kubeflow', 'MLflow'.

2 / 3

Distinctiveness Conflict Risk

The MLOps focus provides some distinctiveness, but terms like 'model training' and 'data preparation' could overlap with general data science or machine learning skills. The description doesn't clearly delineate boundaries from adjacent skills like a general ML skill or a deployment/DevOps skill.

2 / 3

Total

9

/

12

Passed

Implementation

7%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill reads as a high-level overview or course outline rather than actionable guidance. It is extremely verbose, listing concepts and tool names that Claude already knows, while providing almost no executable code, concrete commands, or specific implementation details. The referenced supporting files (references/, assets/) don't exist in the bundle, leaving the skill as an empty shell of descriptions without substance.

Suggestions

Replace the lists of tool names and abstract best practices with concrete, executable code examples — e.g., a complete Airflow DAG definition, a working data validation script with Great Expectations, or a real MLflow experiment tracking integration.

Add explicit validation checkpoints with concrete commands at each pipeline stage (e.g., 'Run `great_expectations checkpoint run my_checkpoint` and verify all expectations pass before proceeding to training').

Remove the 'When to Use This Skill', 'What This Skill Provides', and 'Integration Points' sections — these describe rather than instruct and contain information Claude already knows.

Either provide the referenced bundle files (data-preparation.md, model-training.md, pipeline-dag.yaml.template, etc.) with real content, or inline the critical actionable content directly in the SKILL.md.

DimensionReasoningScore

Conciseness

Extremely verbose and padded with information Claude already knows. Lists of orchestration tools, deployment platforms, best practices like 'modularity' and 'idempotency' are general knowledge. The 'When to Use This Skill' and 'What This Skill Provides' sections are meta-descriptions that waste tokens without teaching anything actionable. The entire document reads like a table of contents or course syllabus rather than a skill.

1 / 3

Actionability

Almost no executable code or concrete commands. The Python code examples are either trivial (a list of strings) or empty comments pointing to other files ('# See references/data-preparation.md'). The YAML example is a skeleton with no real content. There is nothing copy-paste ready that would help Claude actually build an ML pipeline.

1 / 3

Workflow Clarity

The 'Production Workflow' section lists phases at a very high level with no concrete commands, validation checkpoints, or error recovery loops. Steps like 'Run data quality checks' and 'Approve for deployment' are vague directives with no specifics on how to execute or validate them. For a multi-step pipeline involving destructive/batch operations, the complete absence of validation feedback loops is a critical gap.

1 / 3

Progressive Disclosure

The skill references external files in references/ and assets/ directories with clear signaling, which is good structure. However, no bundle files are provided, so all those references are dead links. The main document itself is a monolithic wall of high-level descriptions that should have been split, with the actual actionable content in the referenced files that don't exist.

2 / 3

Total

5

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
wshobson/agents
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.