You can enable YARN ResourceManager High Availability using Cloudera
Manager.
When you enable ResourceManager HA in Cloudera Manager, work preserving recovery is
also enabled for the ResourceManager by default.
-
In Cloudera Manager, select the YARN service.
-
Click Actions.
-
Select Enable High Availability.
A screen showing the hosts that are eligible to run a standby
ResourceManager displays. The host where the current ResourceManager is running
is not available as a choice.
-
Select the host where you want the standby ResourceManager to be
installed.
-
Click Continue.
Cloudera Manager proceeds to run a set of commands that stop the YARN
service, add a standby ResourceManager, initialize the ResourceManager high
availability state in ZooKeeper, restart YARN, and redeploy the relevant client
configurations.
ResourceManager HA does not affect the JobHistory Server (JHS). JHS does
not maintain any state, so if the host fails you can simply assign it to a new
host.
If you want to enable process auto-restart, do the
following:
-
In Cloudera Manager, select the YARN service.
-
Click the Configuration tab.
-
Search for restart.
-
Find the Automatically Restart Process property.
-
Click Edit Individual Values.
-
Select the JobHistory Server Default Group option.
-
Click Save Changes.
-
Restart the JobHistory Server role.