Pyspark Transformer - Auto-activating skill for Data Pipelines. Triggers on: pyspark transformer, pyspark transformer Part of the Data Pipelines skill category.
36
Quality
3%
Does it follow best practices?
Impact
100%
1.04xAverage score across 3 eval scenarios
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./planned-skills/generated/11-data-pipelines/pyspark-transformer/SKILL.mdProduction-ready PySpark Transformer implementation
Transformer subclass
100%
100%
transform() method
100%
100%
Configurable parameter
100%
100%
ML persistence
100%
100%
Pipeline integration
100%
100%
Synthetic data creation
100%
100%
Missing value handling
70%
100%
Output schema validation
33%
100%
Three derived features
100%
100%
Production code quality
100%
100%
Validation report
100%
100%
Without context: $0.8660 · 5m 39s · 35 turns · 35 in / 12,154 out tokens
With context: $1.0830 · 5m 56s · 43 turns · 74 in / 15,120 out tokens
ETL pipeline with output validation
PySpark DataFrame API
100%
100%
Three transformations
100%
100%
Null handling
100%
100%
Parquet output
100%
100%
Partitioned by region
100%
100%
Data quality validation
100%
100%
Validation raises error
100%
100%
Step-by-step structure
100%
100%
Row count logging
100%
100%
Production code patterns
100%
100%
Schema awareness
100%
100%
Without context: $0.4215 · 2m 38s · 20 turns · 20 in / 6,656 out tokens
With context: $0.7623 · 3m 54s · 37 turns · 38 in / 8,762 out tokens
Streaming data pipeline with step-by-step workflow
Structured Streaming API
100%
100%
Schema definition
100%
100%
Null machine_id filter
100%
100%
Alert flag derivation
100%
100%
Tumbling window
100%
100%
Windowed aggregates
100%
100%
WriteStream output
100%
100%
Step-by-step workflow plan
100%
100%
Data quality handling documented
100%
100%
Production appName
100%
100%
No collect() on stream
100%
100%
Without context: $0.6340 · 3m 7s · 30 turns · 29 in / 9,792 out tokens
With context: $0.6736 · 2m 50s · 37 turns · 35 in / 8,731 out tokens
0c08951
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.