HBase sink with Flink
Cloudera Streaming Analytics offers HBase connector as a sink. Like this you can store the output of a real-time processing application in HBase. You must develop your application defining HBase as sink and add HBase dependency to your project.
- Automatic configuration on CDP
- High throughput buffered operations
- Customizable data-driven update/delete logic
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-connector-hbase-1.4</artifactId>
<version>3.0-csa1.13.0.0</version>
</dependency>
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-connector-hbase-2.4</artifactId>
<version>3.0-csa1.13.0.0</version>
</dependency>
The general purpose HBase sink connector is implemented in the
org.apache.flink.addons.hbase.HBaseSinkFunction
class.
This is an abstract class that must be extended to define the interaction logic (mutations)
with HBase. By using the BufferedMutator
instance, you can implement
arbitrary data driven interactions with HBase. While it is possible to run all mutations
supported by the BufferedMutator
interface, Cloudera strongly recommends that
users should only use idempotent mutations: Put and Delete.