Creating Data Hub Clusters in Hybrid Environments

Learn how to create data hub clusters in hybrid environments.

After registering your hybrid environment, you can create Cloudera Data Hub clusters that enable cloud bursting from your on-premises environment.

  • DataHubCreator
  • EnvironmentAdmin at the scope of the environment where the Cloudera Data Hub cluster is running, or
  • Owner of the environment
  1. Navigate to your hybrid environment.
  2. Select the Data Hubs tab on the environment details page.
  3. Click Create Data Hub.
  4. Select the Hybrid Data Engineering: HA: Apache Spark3, Apache Hive definition. The definition will automatically match the on-premises base cluster version.
  5. Provide the Cluster Name.

    The name must be 5 to 40 characters, start with a letter, and can only include lowercase letters, numbers, and hyphens.

  6. Optionally, add tags that the data hub should use to tag your Cloud resources.

    Click Add to add a tag, and then enter a key and value for each tag. Repeat the steps if you would like to add more tags. For more information about tags, refer to Tags in our documentation.

  7. Optionally, click Advanced Options to modify advanced cluster settings. For more information on these options, refer to Advanced cluster options.
  8. Optionally, you can enable Autoscaling on this cluster to leverage cloud elasticity.
  9. Click Provision Cluster.

You will be redirected to the Cloudera Data Hub cluster dashboard. When your cluster is ready, its status will change to Running.