Managing Cloudera Data Science Workbench Hosts

This topic describes how to add and remove hosts on a Cloudera Data Science Workbench deployment.

Adding a Worker Node

Using Cloudera Manager

Perform the following steps to add a new worker node to Cloudera Data Science Workbench.
  1. Add a new host to your cluster. Make sure this is a gateway host and you are not running any services on this host. You might choose to create a host template for worker nodes that will be added to Cloudera Data Science Workbench in the future. The host template must include the Docker Daemon, Worker, and Spark 2 gateway roles.
  2. Assign the Spark 2 gateway role to the new host. For instructions, refer the Cloudera Manager documentation at Adding a Role Instance.
  3. Go to the Cloudera Data Science Workbench service.
  4. Click the Instances tab.
  5. Click Add Role Instances.
  6. Assign the Worker and Docker Daemon roles to the new host. Click Continue.
  7. Review your changes and click Continue. The wizard finishes by performing any actions necessary to prepare the cluster for the new role instances. Do not start the new roles at this point.
  8. The new host must have the following packages installed on it.
    nfs-utils
    libseccomp
    lvm2
    bridge-utils
    libtool-ltdl
    iptables   
    rsync 
    policycoreutils-python 
    selinux-policy-base 
    selinux-policy-targeted 
    ntp 
    ebtables 
    bind-utils 
    nmap-ncat  
    openssl 
    e2fsprogs 
    redhat-lsb-core 
    You can either manually install these packages now, or, go to the Configuration page and enable the Install Required Packages property to allow Cloudera Manager to install them for you as part of the Prepare Node command run in the next step.
  9. Go to the Instances page and select the new host. From the list of available actions, select the Prepare Node command to install the required packages on the new node.
  10. On the Instances page, select the new role instances and click Actions for Selected > Start.

Using Packages

For instructions on how to add a worker node to Cloudera Data Science Workbench, see Installing Cloudera Data Science Workbench on a Worker Node.

Removing a Worker Node

Using Cloudera Manager

Perform the following steps to remove a worker node from Cloudera Data Science Workbench.
  1. Log into the Cloudera Manager Admin Console.
  2. Click the Instances tab.
  3. Select the Docker Daemon and Worker roles on the node to be removed from Cloudera Data Science Workbench.
  4. Select Actions for Selected > Stop and click Stop to confirm the action. Click Close when the process is complete.
  5. On the Instances page, re-select the Docker Daemon and Worker roles that were stopped in the previous step.
  6. Select Actions for Selected > Delete and then click Delete to confirm the action.

Using Packages

To remove a worker node:
  1. On the master node, run the following command to delete the worker node:
    kubectl delete node <worker_node_domain_name>
  2. Reset the worker node.
    cdsw reset

Changing the Domain Name

Cloudera Data Science Workbench allows you to change the domain of the web console.

Using Cloudera Manager

  1. Log into the Cloudera Manager Admin Console.
  2. Go to the Cloudera Data Science Workbench service.
  3. Click the Configuration tab.
  4. Search for the Wildcard DNS Domain property and modify the value to reflect the new domain.
  5. Click Save Changes.
  6. Restart the Cloudera Data Science Workbench service to have the changes go into effect.

Using Packages

  1. Open /etc/cdsw/config/cdsw.conf and set the DOMAIN variable to the new domain name.
    DOMAIN="new-cdsw.company.com"
  2. Run the following commands to have the new domain name go into effect.
    cdsw reset
    cdsw init