Discover documentation to enhance your AI agent's capabilities.
| Name | Contains | Score |
|---|---|---|
Apache Spark - Unified analytics engine for large-scale data processing | Docs | — |
tessl/pypi-pyspark v2.4.1 PySpark Streaming module that enables scalable, high-throughput, fault-tolerant stream processing of live data streams in Python | Docs | — |
YARN auxiliary service for Apache Spark shuffle operations that provides external shuffle service functionality in YARN environments | Docs | — |
Apache Spark network common library providing networking abstractions and utilities for distributed computing | Docs | — |
Core networking library for Apache Spark providing transport layer abstractions and utilities | Docs | — |
Core networking infrastructure for Apache Spark cluster computing with transport layer, RPC, chunk fetching, and SASL authentication | Docs | — |
Spark Project ML Local Library provides local (non-distributed) linear algebra utilities and basic machine learning components. | Docs | — |
Apache Spark's scalable machine learning library providing comprehensive ML algorithms and utilities for large-scale data processing | Docs | — |
Apache Mesos cluster manager integration for Apache Spark enabling distributed computation on Mesos clusters. | Docs | — |
Spark ML Local Library providing linear algebra and statistical utilities for local machine learning operations without requiring a distributed Spark cluster | Docs | — |
Library for launching Spark applications programmatically | Docs | — |
GraphX is Apache Spark's API for graphs and graph-parallel computation | Docs | — |
Ganglia metrics reporting integration for Apache Spark monitoring systems | Docs | — |
Apache Spark Core provides distributed computing capabilities with RDDs, task scheduling, and cluster management for big data processing | Docs | — |
Core utility classes and functions for Apache Spark including exception handling, logging, storage configuration, and Java API integration | Docs | — |
Catalyst query optimization framework and expression evaluation engine for Apache Spark SQL | Docs | — |
Catalyst is Spark's library for manipulating relational query plans and expressions | Docs | — |
Apache Spark Avro connector provides seamless integration between Apache Spark SQL and Apache Avro data format with automatic schema evolution support and built-in compression capabilities | Docs | — |
Core server component of Apache SkyWalking APM system providing distributed tracing, monitoring, and observability capabilities for microservices and cloud-native architectures. | Docs | — |
Web support module for Apache Shiro providing servlet filters, session management, and web-specific authentication and authorization features | Docs | — |
Can't find what you're looking for? Evaluate a missing skill.