Perform the recovery
After you have identified a reference master, you need to copy the master data to the replacement master node. You need to bring the Kudu clusters down. Therefore, identify at least a one-hour maintenance window for this task.
To verify that all masters are working properly, consider performing the
following sanity checks:
-
Using a browser, visit each master’s web UI and navigate to the
/masters
page. All the masters should now be listed there with one master in theLEADER
role and the others in theFOLLOWER
role. The contents of/masters
on each master should be the same. -
Run a Kudu system check (
ksck
) on the cluster using thekudu
command line tool.