About Oozie High Availability
In CDH 5, you can configure multiple active Oozie servers against the same
database, providing high availability for Oozie. This is supported in both MRv1 or MRv2
(YARN). You need:
- Multiple Oozie servers, preferably identically configured;
- A database that supports multiple concurrent connections, and (preferably) has HA support;
- A ZooKeeper ensemble comprising at least three ZooKeeper servers;
- A loadbalancer (preferably with HA support, for example HAProxy), Virtual IP, or Round-Robin DNS, to provide a single entry-point for users and for callbacks from the Application Master or JobTracker.
For more information, and installation and configuration instructions, see http://archive.cloudera.com/cdh5/cdh/5/oozie.
For instructions on how to set up Oozie HA with Kerberos see Configuring Oozie HA with Kerberos.
<< Configuring High Availability for Other CDH Components | Cloudera Impala User Guide >> | |