Cloudera Data Science Workbench Administration Guide
- Go to the Cloudera Data Science Workbench web application (http://cdsw.company.com) and log in as a site administrator.
- On the left sidebar, click Admin. You will see an array of tabs for all the tasks you can perform as a site administrator.
The rest of this topic describes some common tasks for a Cloudera Data Science Workbench site administrator.
Managing Users
As a site administrator you can add new users, assign or modify privileges for existing users, and monitor user activity on the Cloudera Data Science Workbench deployment.
Adding New Users
To invite new users, navigate to the Invite. This tab will show you a list of all outstanding invitations. Once an invitation has been accepted, the record will no longer show up on this page. The Users tab also displays a list of users of the application. Click on a username to see more details about the user.
tab. Under Invitations, enter the name or email ID of the person you want to invite and clickIf you want new users to join by invitation only, go to the Require invitation to sign up checkbox to require invitation tokens for account creation. By default, invitations are sent from noreply@your-cdsw-domain. To modify this default, see Setting up Email Notifications.
tab and check theAssigning the Site Administrator role to an Existing User
- Sign in to Cloudera Data Science Workbench as a site administrator.
- Click Admin.
- Click the Users tab.
- Click on the username of the user who you want to make a site administrator.
- Select the Site Administrator checkbox.
- Click Update.
Disabling User Accounts
Use the following instructions to disable user accounts. Note that disabled users cannot login and do not count towards named users for licensing.
- Sign in to Cloudera Data Science Workbench as a site administrator.
- Click Admin.
- Click the Users tab.
- Click on the username of the user who you want to disable.
- Select the Disabled checkbox.
- Click Update.
Monitoring Users
The Users tab on the admin dashboard displays the complete list of users. You can see which users are currently active, and when a user last logged in to the Cloudera Data Science Workbench. To modify a user's username, email or permissions, click the Edit button under the Action column.
Monitoring Site Usage
The
tab displays basic information about your deployment, such as the number of users signed up, the number of teams and projects created, memory used, and some average job scheduling and run times. You can also see the version of Cloudera Data Science Workbench you are currently running.On the
tab of the dashboard, you can see the status for all the previous and current sessions running on the workbench.Managing Engine Profiles
On the
page, under Engines Profiles, you can provide default engine profiles to users. These engine profiles provide default CPU and memory configurations for sessions and jobs. Cloudera recommends that all profiles include at least 2 GB of RAM to avoid out of memory errors for common user operations.Adding Custom Engine Images
Cloudera Data Science Workbench site administrators can add libraries and other dependencies to the Docker image in which their engines run. You can whitelist images for use in projects on the
page, under the Engine Images section. Currently, Cloudera Data Science Workbench only supports public Docker images in registries accessible to the Cloudera Data Science Workbench nodes.Project administrators will need to explicitly select which of these white-listed images is installed for their projects. For an example, see Customizing Engine Images.
Customizing the Engine Environment
On the
page, go to the Environmental Variables section to provide values for global environment variables that must be injected into all engines across the deployment.Setting JAVA_HOME
By default, Cloudera Data Science Workbench will attempt to find your Java home. However, if you are using a non-standard path for Java, add JAVA_HOME under the Environmental Variables section. This will set JAVA_HOME and mount the JAVA_HOME directory into engines so that the Java version matches other CDH services.
Configuring External Authentication
Cloudera Data Science Workbench supports external authentication using LDAP and SAML protocols. You can configure LDAP or SAML authentication under the Configuring External Authentication.
tab by following the instructions atSetting up Email Notifications
Go to the
tab to specify an email address for outbound invitations and job notifications.By default, all emails are sent from noreply@your-cdsw-domain. However, if your SMTP domain is different from the Cloudera Data Science Workbench domain, or it does not allow spoofing, you will need to explicitly specify the email address at the No Reply Email field.
Currently, Cloudera Data Science Workbench only sends email notifications when you add teammates to a project, not when you create a new project. Email preferences cannot currently be configured at an individual user level.
Managing License Keys
Cloudera Data Science Workbench is subject to the same license and subscription model as Cloudera Enterprise. To upload a license key, go to the Managing License Keys for Cloudera Data Science Workbench.
tab. For details on the types of licenses and detailed instructions for how to upload a new license key, seeDisabling Analytics Tracking
To help improve the product, Cloudera Data Science Workbench by default collects aggregate usage data by sending limited tracking events to Google Analytics and Cloudera servers. No customer data or personal information is sent as part of these bundles. To disable analytics tracking, go to Send usage data to Cloudera checkbox.
and uncheck the