Apache Flink v1.13-SNAPSHOT

  • Home

  • Try Flink
    • Local Installation
    • Fraud Detection with the DataStream API
    • Real Time Reporting with the Table API
    • Flink Operations Playground
  • Learn Flink
    • Overview
    • Intro to the DataStream API
    • Data Pipelines & ETL
    • Streaming Analytics
    • Event-driven Applications
    • Fault Tolerance
  • Concepts
    • Overview
    • Stateful Stream Processing
    • Timely Stream Processing
    • Flink Architecture
    • Glossary

  • Application Development
    • DataStream API
      • Overview
      • Execution Mode (Batch/Streaming)
      • Event Time
        • Overview
        • Generating Watermarks
        • Builtin Watermark Generators
      • State & Fault Tolerance
        • Overview
        • Working with State
        • The Broadcast State Pattern
        • Checkpointing
        • Queryable State
        • State Backends
        • State Schema Evolution
        • Custom State Serialization
      • User-Defined Functions
      • Operators
        • Overview
        • Windows
        • Joining
        • Process Function
        • Async I/O
      • Data Sources
      • Side Outputs
      • Handling Application Parameters
      • Testing
      • Experimental Features
      • Scala API Extensions
      • Java Lambda Expressions
      • Project Configuration
    • DataSet API
      • Overview
      • Transformations
      • Iterations
      • Zipping Elements
      • Hadoop Compatibility
      • Local Execution
      • Cluster Execution
      • Batch Examples
    • Table API & SQL
      • Overview
      • Concepts & Common API
      • Streaming Concepts
        • Overview
        • Dynamic Tables
        • Time Attributes
        • Versioned Tables
        • Joins in Continuous Queries
        • Detecting Patterns
        • Query Configuration
        • Legacy Features
      • Data Types
      • Table API
      • SQL
        • Overview
        • Getting Started
        • Queries
        • CREATE Statements
        • DROP Statements
        • ALTER Statements
        • INSERT Statement
        • SQL Hints
        • DESCRIBE Statements
        • EXPLAIN Statements
        • USE Statements
        • SHOW Statements
      • Functions
        • Overview
        • System (Built-in) Functions
        • User-defined Functions
      • Modules
      • Catalogs
      • SQL Client
      • Configuration
      • Performance Tuning
        • Streaming Aggregation
      • User-defined Sources & Sinks
    • Python API
      • Overview
      • Installation
      • Table API Tutorial
      • DataStream API Tutorial
      • Table API User's Guide
        • Intro to the Python Table API
        • TableEnvironment
        • Operations
        • Data Types
        • System (Built-in) Functions
        • User Defined Functions
          • General User-defined Functions
          • Vectorized User-defined Functions
        • Conversions between PyFlink Table and Pandas DataFrame
        • Dependency Management
        • SQL
        • Catalogs
        • Metrics
        • Connectors
      • DataStream API User's Guide
        • Data Types
        • Operators
        • Dependency Management
      • Configuration
      • Environment Variables
      • FAQ
    • Data Types & Serialization
      • Overview
      • Custom Serializers
    • Managing Execution
      • Execution Configuration
      • Program Packaging
      • Parallel Execution
      • Execution Plans
      • Task Failure Recovery
    • API Migration Guides
  • Libraries
    • Event Processing (CEP)
    • State Processor API
    • Graphs: Gelly
      • Overview
      • Graph API
      • Iterative Graph Processing
      • Library Methods
      • Graph Algorithms
      • Graph Generators
      • Bipartite Graph
  • Connectors
    • DataStream Connectors
      • Overview
      • Fault Tolerance Guarantees
      • Kafka
      • Cassandra
      • Kinesis
      • Elasticsearch
      • File Sink
      • Streaming File Sink
      • RabbitMQ
      • NiFi
      • Google Cloud PubSub
      • Twitter
      • JDBC
    • Table & SQL Connectors
      • Overview
      • Formats
        • Overview
        • CSV
        • JSON
        • Confluent Avro
        • Avro
        • Debezium
        • Canal
        • Maxwell
        • Parquet
        • Orc
        • Raw
      • Kafka
      • Upsert Kafka
      • Kinesis
      • JDBC
      • Elasticsearch
      • FileSystem
      • HBase
      • DataGen
      • Print
      • BlackHole
      • Hive
        • Overview
        • Hive Catalog
        • Hive Dialect
        • Hive Read & Write
        • Hive Functions
      • Download
    • DataSet Connectors
  • Deployment
    • Overview
    • Resource Providers
      • Standalone
        • Overview
        • Docker
        • Kubernetes
      • Native Kubernetes
      • YARN
      • Mesos
    • Configuration
    • Memory Configuration
      • Set up Flink's Process Memory
      • Set up TaskManager Memory
      • Set up JobManager Memory
      • Memory tuning guide
      • Troubleshooting
      • Migration Guide
    • Command-Line Interface
    • File Systems
      • Overview
      • Common Configurations
      • Amazon S3
      • Aliyun OSS
      • Azure Blob Storage
      • Plugins
    • High Availability (HA)
      • Overview
      • ZooKeeper HA Services
      • Kubernetes HA Services
    • Metric Reporters
    • Security
      • SSL Setup
      • Kerberos
    • REPLs
      • Python REPL
      • Scala REPL
    • Advanced
      • External Resources
      • History Server
      • Logging
  • Operations
    • State & Fault Tolerance
      • Checkpoints
      • Savepoints
      • State Backends
      • Tuning Checkpoints and Large State
    • Metrics
    • REST API
    • Debugging
      • Debugging Windows & Event Time
      • Debugging Classloading
      • Application Profiling & Debugging
    • Monitoring
      • Monitoring Checkpointing
      • Monitoring Back Pressure
    • Upgrading Applications and Flink Versions
    • Production Readiness Checklist

  • Flink Development
    • Importing Flink into an IDE
    • Building Flink from Source
  • Internals
    • Jobs and Scheduling
    • Task Lifecycle
    • File Systems
  • Javadocs
  • Scaladocs
  • Pythondocs
  • Project Page
  • v1.12
  • v1.11
  • v1.10
  • v1.9
  • v1.8
  • v1.7
  • v1.6
  • v1.5
  • v1.4
  • v1.3
  • v1.2
  • v1.1
  • v1.0
  1. Connectors
  2. Table & SQL Connectors
  3. Formats

Formats

Flink provides a set of table formats that can be used with table connectors. A table format is a storage format defines how to map binary data onto table columns.

Flink supports the following formats:

Formats Supported Connectors
CSV Apache Kafka, Upsert Kafka, Amazon Kinesis Data Streams, Filesystem
JSON Apache Kafka, Upsert Kafka, Amazon Kinesis Data Streams, Filesystem, Elasticsearch
Apache Avro Apache Kafka, Upsert Kafka, Amazon Kinesis Data Streams, Filesystem
Confluent Avro Apache Kafka, Upsert Kafka
Debezium CDC Apache Kafka, Filesystem
Canal CDC Apache Kafka, Filesystem
Maxwell CDC Apache Kafka, Filesystem
Apache Parquet Filesystem
Apache ORC Filesystem
Raw Apache Kafka, Upsert Kafka, Amazon Kinesis Data Streams, Filesystem
Want to contribute translation?