Getting started as an admin

Refer to this section if you are a CDP admin who is trying to get started in CDP.

Click to access full instructions on how to configure an identity providerClick to access full instructions on how to set up an environmentClick to access full instructions on how to add users to an environmentClick to access full instructions on how to create Ranger policesClick to access full instructions on how to register classic clustersClick to access full instructions related to using Data Hub serviceClick to access full instructions related to using Data Warehouse serviceClick to access full instructions related to using Machine Learning serviceClick to access full instructions on how to onboard users for cloud storage

Accessing CDP for the first time

Access the CDP web interface at https://console.cdp.cloudera.com. When logging in for the first time, log in by using your Salesforce credentials.

Setting up Identity provider

In order to add users from your organization to CDP, set up your identity provider. For instructions, refer to Onboarding users.

Setting up an environment

Register an environment for your organization. An environment determines the specific cloud provider region and virtual network in which resources can be provisioned, and includes the credential that should be used to access the cloud provider account. For instructions, refer to Environments documentation.

Creating environment groups and adding users

Once your environment is up and running, you should create a new group for the environment or assign an existing group to this environment, add users to this group, and then perform user sync. For instructions, refer to Enabling admin and user access to environments.

Onnboarding users and groups for cloud storage

The minimal setup for cloud storage defined in environment prerequisites spins up a CDP environment and Data Lake with no end user access to cloud storage. Adding users and groups to a CDP cluster involves ensuring they are properly mapped to IAM roles to access cloud storage. For instructions, refer to Onboarding CDP users and groups for cloud storage.

Setting up Ranger authorization for your data lake

Once your environment is up and running, you should log in to Ranger and create policies for access to specific tables and databases. You can either log in to Hive first and create resources and then create policies for them in Ranger, or you can create Ranger policies in advance.

For instructions on how to access your Data Lake cluster, refer to Accessing Data Lake services. For instructions on how to set up authorization in Ranger, refer to Using Ranger to provide authorization documentation.

Provisioning compute resources

After performing these steps, you are set to start provisioning compute resources (Data Hub clusters, Data Warehouses, and Machine Learning workspaces. For more information, refer to the following documentation:

Registering your existing clusters

You can optionally register your existing CDH, HDP, or HDF clusters in CDP if you would like to generate a workload, data movement, and compute capacity plan and replicate your data. For instructions, refer to Managing classic clusters.