CtrlK
BlogDocsLog inGet started
Tessl Logo

tessl/maven-org-apache-spark--spark-streaming_2-11

Apache Spark Streaming library for processing live data streams with fault-tolerance and high throughput.

Pending
Overview
Eval results
Files

window-operations.mddocs/

Window Operations

Window operations allow you to apply transformations over a sliding window of data, enabling analysis across multiple batches of streaming data.

Core Window Operations

def window(windowDuration: Duration, slideDuration: Duration): DStream[T]
def reduceByWindow(
  reduceFunc: (T, T) => T,
  windowDuration: Duration, 
  slideDuration: Duration
): DStream[T]
def reduceByWindow(
  reduceFunc: (T, T) => T,
  invReduceFunc: (T, T) => T,
  windowDuration: Duration,
  slideDuration: Duration
): DStream[T]
def countByWindow(windowDuration: Duration, slideDuration: Duration): DStream[Long]

Keyed Window Operations

def groupByKeyAndWindow(windowDuration: Duration, slideDuration: Duration): DStream[(K, Iterable[V])]
def reduceByKeyAndWindow(
  func: (V, V) => V,
  windowDuration: Duration,
  slideDuration: Duration
): DStream[(K, V)]
def countByKeyAndWindow(windowDuration: Duration, slideDuration: Duration): DStream[(K, Long)]

Example:

val windowedWordCounts = wordPairs
  .reduceByKeyAndWindow((a: Int, b: Int) => a + b, Seconds(30), Seconds(10))

Install with Tessl CLI

npx tessl i tessl/maven-org-apache-spark--spark-streaming_2-11

docs

dstream-operations.md

index.md

input-sources.md

java-api.md

monitoring-listeners.md

paired-dstream-operations.md

receiver-framework.md

streaming-context.md

utility-classes.md

window-operations.md

tile.json