Troubleshooting Multi-AZ deployments

Troubleshoot various scenarios which you might encounter while deploying Multi-AZ on your COD environment.

Unable to join the cluster automatically

Condition

HBase Region Servers do not join the cluster automatically after the availability zones are recovered.

Cause

The availability zones and servers are offline for too long and the Master and Region Server processes are stopped.

Solution

  1. Log in to Cloudera Manager.
  2. Restart the Master and Region Server processes.

OMID service fails to recover

Condition

OMID service is failing to recover after the availability zones are down.

Cause

Root cause of this problem is yet to be identified.

Solution

  1. Log in to Cloudera Manager.
  2. Restart the OMID service.