Example: Configuration for work preserving recovery
Use work preserving recovery properties to enable and configure this feature.
<property> <name>yarn.resourcemanager.work-preserving-recovery.enabled</name> <value>true</value> <description>Whether to enable work preserving recovery for the Resource Manager.</description> </property> <property> <name>yarn.nodemanager.recovery.enabled</name> <value>true</value> <description>Whether to enable work preserving recovery for the Node Manager.</description> </property> <property> <name>yarn.nodemanager.recovery.dir</name> <value>/home/cloudera/recovery</value> <description>The location for stored state on the Node Manager, if work preserving recovery is enabled.</description> </property> <property> <name>yarn.nodemanager.address</name> <value>0.0.0.0:45454</value> <description>The address of the container manager in the Node Manager.</description> </property> <property> <name>yarn.resourcemanager.store.class</name> <value>org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore</value> <description>The class name of the state-store to be used for saving application/attempt state and the credentials. </description> </property> <property> <name>hadoop.zk.address</name> <value>0.0.0.0:3001,0.0.0.0:3002,0.0.0.0:3003</value> <description>Comma separated list of Host:Port pairs. Each corresponds to a ZooKeeper to be used by the Resource Manager for storing Resource Manager state.</description> </property> <property> <name>yarn.resourcemanager.zk-state-store.parent-path</name> <value>/rmstore</value> <description>The full path of the root znode where Resource Manager state will be stored.</description> </property> <property> <name>hadoop.zk.num-retries</name> <value>500</value> <description>Number of times Resource Manager tries to connect to ZooKeeper server if the connection is lost.</description> </property> <property> <name>hadoop.zk.retry-interval-ms</name> <value>120</value> <description>The interval in milliseconds between retries when connecting to a ZooKeeper server.</description> </property> <property> <name>hadoop.zk.timeout-ms</name> <value>10</value> <description> ZooKeeper session timeout in milliseconds. Session expiration is managed by the ZooKeeper cluster itself and not by the client. This value is used by the cluster to determine when the client's session expires. Expirations happens when the cluster does not hear from the client within the specified session timeout period (that is, no heartbeat). </description> </property> <property> <name>hadoop.zk.acl</name> <value>world:anyone:rwcda</value> <description>ACLs to be used for setting permissions on ZooKeeper znodes.</description> </property>