This documentation is for an out-of-date version of Apache Flink. We recommend you use the latest stable version.

Overview

This documentation is for Apache Flink version 0.10.2, which is the current development version of the next upcoming major release of Apache Flink.

Apache Flink is an open source platform for distributed stream and batch data processing. Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. Flink also builds batch processing on top of the streaming engine, overlaying native iteration support, managed memory, and program optimization.

If you want to write your first program, look at one of the available quickstarts, and refer to the DataSet API guide or the DataStream API guide.

Stack

This is an overview of Flink’s stack. Click on any component to go to the respective documentation page.

Stack

Graph API: Gelly Flink ML Table DataSet API (Java/Scala) DataStream API (Java/Scala) Flink Runtime Local Remote Embedded YARN Tez