QuickstartPDF version

Enable Cloudera Data Flow for your environment

Before you can deploy flow definitions, you must enable Cloudera Data Flow for a Cloudera on cloud environment. Enabling Cloudera Data Flow for an environment means that you are preparing an active and healthy Cloudera on cloud environment for use with Cloudera Data Flow.

  • You have a cloud provider account and meet the infrastructure and network requirements.
  • You have a healthy Cloudera on cloud environment, with FreeIPA and the data lake running and healthy.
  • You have the DFAdmin role for the Cloudera on cloud environment for which you want to enable Cloudera Data Flow.
  1. Navigate to Cloudera Data Flow, by selecting DataFlow from the Cloudera on cloud Home Page.
  2. In Cloudera Data Flow, navigate to Environments, and click Enable to launch the Enable Environment dialog for the environment you want to enable.
  3. From Enable Environment, provide the following information:
    • DataFlow Capacity – Specifies Kubernetes cluster minimum and maximum size
    • Networking
    • Specify whether a public endpoint should be deployed to access CDF components via the internet.
    • A list of source IP address ranges which are allowed to connect to the Kubernetes API server.
  4. Click Enable. Enabling CDF can take up to one hour.

When you have finished enabling Cloudera Data Flow for an environment, proceed by giving users permission to import and deploy flow definitions.

We want your opinion

How can we improve this page?

What kind of feedback do you have?