The result of DataSet.aggregate.
A specific DataSet that results from a coGroup
operation.
A specific DataSet that results from a cross
operation.
The DataSet, the basic abstraction of Flink.
The ExecutionEnviroment is the context in which a program is executed.
A DataSet to which a grouping key was added.
A specific DataSet that results from a join
operation.
The result of DataSet.sortPartition.
SelectByMaxFunction to work with Scala tuples
SelectByMinFunction to work with Scala tuples
An unfinished coGroup operation that results from DataSet.coGroup The keys for the left and
right side must be specified using first where
and then equalTo
.
An unfinished inner join operation that results from calling DataSet.join().
An unfinished outer join operation that results from calling, e.
acceptPartialFunctions extends the original DataSet with methods with unique names that delegate to core higher-order functions (e.
The Flink Scala API. org.apache.flink.api.scala.ExecutionEnvironment is the starting-point of any Flink program. It can be used to read from local files, HDFS, or other sources. org.apache.flink.api.scala.DataSet is the main abstraction of data in Flink. It provides operations that create new DataSets via transformations. org.apache.flink.api.scala.GroupedDataSet provides operations on grouped data that results from org.apache.flink.api.scala.DataSet.groupBy().
Use org.apache.flink.api.scala.ExecutionEnvironment.getExecutionEnvironment to obtain an execution environment. This will either create a local environment or a remote environment, depending on the context where your program is executing.