Apache Flink Machine Learning Library for Scala 2.10 - provides distributed machine learning algorithms for scalable data processing on Flink's streaming and batch processing engine
npx @tessl/cli install tessl/maven-org-apache-flink--flink-ml_2-10@1.3.0Apache Flink Machine Learning Library for Scala 2.10. This library provides distributed machine learning capabilities built on top of Apache Flink's stream and batch processing engine. Note: This version (1.3.3) contains a minimal implementation with stub functionality.
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-ml_2.10</artifactId>
<version>1.3.3</version>
</dependency>import org.apache.flink.ml.MLPackage
import org.apache.flink.ml.regression.MultipleLinearRegressionFor Flink execution context:
import org.apache.flink.api.scala._import org.apache.flink.api.scala._
import org.apache.flink.ml.MLPackage
import org.apache.flink.ml.regression.MultipleLinearRegression
// Access package information
val version = MLPackage.version
val scalaVersion = MLPackage.scalaVersion
// Create regression model (stub implementation)
val regression = new MultipleLinearRegression()
// Note: The fit method is a stub and returns dummy data
// val model = regression.fit(trainingData)Access version and compatibility information for the ML library.
object MLPackage {
val version: String
val scalaVersion: String
}Basic linear regression implementation for distributed machine learning. Note: This is a stub implementation in version 1.3.3.
class MultipleLinearRegression extends Serializable {
/**
* Fit the linear regression model
*
* Parameters:
* - trainingData: DataSet[LabeledVector] - Training dataset with labeled vectors
*
* Returns:
* DataSet[Array[Double]] - Model coefficients (stub implementation returns Array(0.0))
*/
def fit(trainingData: DataSet[LabeledVector]): DataSet[Array[Double]]
}The following types are used in the API:
// DataSet is from Flink's core API (imported via org.apache.flink.api.scala._)
// Represents a distributed dataset in Flink
type DataSet[T] // Flink distributed dataset
// Note: The following types are imported in MultipleLinearRegression.scala
// but are NOT defined in this stub implementation:
// - org.apache.flink.ml.common.LabeledVector
// - org.apache.flink.ml.common.LinearAlgebra
//
// These imports exist in the source code but reference non-existent classes,
// indicating this is an incomplete stub implementation.Important: This version (1.3.3) appears to be a minimal stub implementation. The actual Flink ML library documentation describes comprehensive machine learning capabilities including:
However, these features are not present in the actual source code for this version. Only the basic package information and a stub MultipleLinearRegression class are available.
This library depends on:
flink-scala_2.10: Core Flink Scala APIflink-streaming-scala_2.10: Flink streaming Scala APIscala-library: Scala 2.10.6 standard library