ConfigurationPDF version

RocksDB state backend configuration

You can use RocksDB as a state backend when your Flink streaming application requires a larger state. With ClouderaConfigurableOptionsFactory, you are able to configure RocksDB for YARN applications to have a limit on memory usage supported by the Write Buffer Manager.

The default RocksDB state backend configuration available in Flink is not suitable for YARN applications with large state, as it does not effectively limit the native memory usage of the embedded RocksDB database. To effectively enforce a limit on memory usage, support for Write Buffer Manager is introduced in the ClouderaConfigurableOptionsFactory.

ClouderaConfigurableOptionsFactory can be added to the Flink configuration file to get a good initial configuration of the state backend:
state.backend: rocksdb
state.backend.rocksdb.options-factory: org.apache.flink.contrib.streaming.state.ClouderaConfigurableOptionsFactory

For the default RocksDB state backend configuration, see the Apache Flink documentation. See more information about the Write Buffer Manager here.

We want your opinion

How can we improve this page?

What kind of feedback do you have?