Discover and install skills, docs, and rules to enhance your AI agent's capabilities.
| Name | Contains | Score |
|---|---|---|
MQTT receiver for Apache Spark Streaming that enables real-time processing of messages from MQTT brokers | Docs | — |
Apache Spark Streaming integration with ZeroMQ message queue system for real-time data processing | Docs | — |
Twitter feed receiver for Apache Spark Streaming that enables real-time consumption of Twitter data streams | Docs | — |
Apache Spark Streaming integration with Amazon Kinesis for real-time processing of streaming data | Docs | — |
Spark Streaming integration with Amazon Kinesis for real-time data processing using the Kinesis Client Library | Docs | — |
Apache Spark Streaming integration library for consuming data from Amazon Kinesis streams with fault-tolerant checkpointing and automatic shard management | Docs | — |
A shaded JAR assembly containing all dependencies needed for Kafka integration with Spark Streaming | Docs | — |
Assembly JAR providing Apache Spark integration with Apache Kafka 0.10 for reliable distributed streaming data processing | Docs | — |
Interactive Scala shell for Apache Spark enabling distributed data processing and analysis | Docs | — |
Apache Spark interactive Scala shell (REPL) for Scala 2.10 providing read-eval-print loop interface for interactive data analysis and cluster computing. | Docs | — |
Apache Spark connector for Protocol Buffer (protobuf) data format support, providing SQL functions to convert between binary protobuf data and Catalyst data structures for processing structured data in distributed big data analytics pipelines | Docs | — |
Apache Spark is a unified analytics engine for large-scale data processing with high-level APIs in Scala, Java, Python, and R. | Docs | — |
Apache Spark - Unified analytics engine for large-scale data processing | Docs | — |
tessl/pypi-pyspark v2.4.1 PySpark Streaming module that enables scalable, high-throughput, fault-tolerant stream processing of live data streams in Python | Docs | — |
YARN auxiliary service for Apache Spark shuffle operations that provides external shuffle service functionality in YARN environments | Docs | — |
Apache Spark network common library providing networking abstractions and utilities for distributed computing | Docs | — |
Core networking library for Apache Spark providing transport layer abstractions and utilities | Docs | — |
Core networking infrastructure for Apache Spark cluster computing with transport layer, RPC, chunk fetching, and SASL authentication | Docs | — |
Spark Project ML Local Library provides local (non-distributed) linear algebra utilities and basic machine learning components. | Docs | — |
Apache Spark's scalable machine learning library providing comprehensive ML algorithms and utilities for large-scale data processing | Docs | — |
Can't find what you're looking for? Evaluate a missing skill.