Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that do not fit in memory.
84
86%
Does it follow best practices?
Impact
72%
1.22xAverage score across 3 eval scenarios
Advisory
Suggest reviewing before use
Security
1 medium severity finding. This skill can be installed but you should review these findings before use.
The skill exposes the agent to untrusted, user-generated content from public third-party sources, creating a risk of indirect prompt injection. This includes browsing arbitrary URLs, reading social media posts or forum comments, and analyzing content from unknown websites.
Third-party content exposure detected (high risk: 0.70). The skill's I/O documentation explicitly shows loading data and state from third-party locations (e.g., vaex.open('s3://bucket-name/data.parquet'), vaex.open('gs://bucket-name/data.parquet'), vaex.open('ws://hostname:9000/data') and df.state_load('state.json') in references/io_operations.md), which means untrusted external files/state can be ingested and can restore transformations/models that materially influence subsequent operations.
1420470
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.