Performing maintenance of a single host in the Embedded Container Service cluster

You can perform maintenance on the nodes in your ECS cluster by shutting down the nodes one at a time.

  • The containerized cluster must be configured for ECS Server high availability to reduce the downtime.
  • You must be able to log into the nodes as root or have sudo privileges.
  • The node to be maintained must have a status of Ready. A status of NotReady may suggest the node is having other complicating issues. Run the following command on an ECS server node to verify status of the nodes.
    /var/lib/rancher/rke2/bin/kubectl --kubeconfig=/etc/rancher/rke2/rke2.yaml get nodes
  1. Log in to the Cloudera Manager Admin Console.
  2. Stop the ECS role and the Docker server role on the host.
    • Click the Hosts tab.
    • Select the Host->Action->Stop roles on the host.
  3. Perform the maintenance on the host.
  4. Reboot the host.
  5. Log in to the Cloudera Manager Admin Console.
  6. Click the Action menu next to the ECS cluster and select Start roles on the host.
  7. Click Actions > Refresh ECS Cluster.
  8. Go to the ECS service page and verify that the Vault is not sealed. This information displays in the Health Tests section.
  9. If the Vault is sealed, click Actions > Unseal Vault.

Performing the maintenance of all hosts in the Embedded Container Service cluster

If you want to perform the maintenance of all hosts in the ECS cluster follow below steps:

  1. Log in to the Cloudera Manager Admin Console.
  2. Click the Action menu next to the ECS cluster and select Stop.
  3. Perform the maintenance on all the hosts.
  4. Reboot the hosts.
  5. Log in to the Cloudera Manager Admin Console.
  6. Click the Action menu next to the ECS cluster and select Start.
  7. Click Actions > Refresh ECS Cluster.
  8. Go to the ECS service page and verify that the Vault is not sealed. This information displays in the Health Tests section.
  9. If the Vault is sealed, click Actions > Unseal Vault.