CtrlK
BlogDocsLog inGet started
Tessl Logo

Discover docs

Discover documentation to enhance your AI agent's capabilities.

AllSkillsDocsRules
NameContainsScore
tessl/maven-org-apache-spark--spark-sql-2-12
v2.4.0

Apache Spark's distributed SQL engine and structured data processing framework for manipulating structured data using SQL queries and DataFrame/Dataset APIs

Docs

Pending

Spark SQL module for Apache Spark providing structured data processing with SQL and DataFrame APIs

Docs

Pending

Kafka 0.10+ Source for Structured Streaming

Docs

Pending

Apache Spark Structured Streaming integration with Apache Kafka providing comprehensive data source and sink capabilities for both batch and streaming workloads.

Docs

Pending

Spark SQL API module providing core SQL data types, rows, and foundational APIs for Spark SQL operations

Docs

Pending

Interactive Scala shell (REPL) component for Apache Spark providing real-time data processing capabilities and exploratory data analysis

Docs

Pending

Apache Spark connector for Protocol Buffers data source enabling seamless protobuf serialization and deserialization in Spark SQL.

Docs

Pending

Apache Spark YARN Shuffle Service - provides shuffle service functionality for YARN-managed clusters

Docs

Pending

External shuffle service client for Apache Spark that enables reading shuffle blocks from external servers instead of executors

Docs

Pending

Apache Spark MLlib is a scalable machine learning library that provides high-level APIs for common machine learning algorithms and utilities

Docs

Pending

Apache Spark Mesos resource manager that enables Spark applications to run on Apache Mesos clusters with both coarse-grained and fine-grained scheduling modes

Docs

Pending

Library for launching Spark applications programmatically with monitoring and control capabilities.

Docs

Pending

A key-value store abstraction for storing application data locally with automatic serialization, indexing, and support for multiple storage backends.

Docs

Pending

Hive integration module for Apache Spark providing HiveQL support and Hive metastore access

Docs

Pending

Hadoop cloud integration capabilities for Apache Spark, enabling seamless interaction with cloud storage systems

Docs

Pending

Ganglia integration module for Apache Spark metrics system

Docs

Pending

Ganglia metrics sink integration for Apache Spark enabling metrics reporting to Ganglia monitoring systems

Docs

Pending

Docker-based integration testing framework for Apache Spark JDBC connectivity with multiple database systems

Docs

Pending

Docker integration tests for Apache Spark providing automated JDBC database testing with containerized environments.

Docs

Pending

A decoupled client-server architecture component for Apache Spark that enables remote connectivity to Spark clusters using the DataFrame API and gRPC protocol.

Docs

Pending

Can't find what you're looking for? Evaluate a missing skill.