Enable Cloudera DataFlow for your environment

Before you can deploy flow definitions, you must enable Cloudera DataFlow for a Cloudera Public Cloud environment. Enabling Cloudera DataFlow for an environment means that you are preparing an active and healthy Cloudera Public Cloud environment for use with Cloudera DataFlow.

  • You have a cloud provider account and meet the infrastructure and network requirements.
  • You have a healthy Cloudera Public Cloud environment, with FreeIPA and the data lake running and healthy.
  • You have the DFAdmin role for the Cloudera Public Cloud environment for which you want to enable Cloudera DataFlow.
  1. Navigate to Cloudera DataFlow, by selecting DataFlow from the Cloudera Public Cloud Home Page.
  2. In Cloudera DataFlow, navigate to Environments, and click Enable to launch the Enable Environment dialog for the environment you want to enable.
  3. From Enable Environment, provide the following information:
    • DataFlow Capacity – Specifies Kubernetes cluster minimum and maximum size
    • Networking
    • Specify whether a public endpoint should be deployed to access CDF components via the internet.
    • A list of source IP address ranges which are allowed to connect to the Kubernetes API server.
  4. Click Enable. Enabling CDF can take up to one hour.

When you have finished enabling Cloudera DataFlow for an environment, proceed by giving users permission to import and deploy flow definitions.