Operational database cluster

You can create an HBase cluster along with all the required services such as HDFS and ZooKeeper using the Operational Database Data Hub cluster template in CDP. The operational database cluster uses Amazon S3 as a storage layer for HBase, where HFiles are written to S3, but WALs are written to HDFS.

The Operational Database cluster template in CDP consists of HDFS, HBase, and ZooKeeper. CDP makes use of reusable cluster templates. The cluster template for Operational Database consists of Apache HBase and the underlying HDFS and Apache ZooKeeper services that it needs to operate. Knox is used for the proxying of the different user interfaces.

Each cluster definition must reference a specific cluster template. You can also create and upload your cluster templates. For example, you can Export a cluster template from an existing on-prem HDP cluster and register it in CDP to use it for creating Data Hub clusters.