Defining your Cloudera on cloud data flow

To move data between cloud environments using NiFi site-to-site communication, you require a data flow in Cloudera on cloud that can receive data from the Cloudera Base on premises data flow. To create this data flow, configure a process group, and both an input and output port.

You have prepared you clusters, set up your network configurations, and configured your truststores.

  1. From your Cloudera on cloud NiFi cluster, create a Process Group to perform the operations you want to complete on the data received from and returned to the Cloudera Base on premises cluster.
  2. Drag an Input Port onto the NiFi canvas.
    You must use this port for receiving data from NiFi's Cloudera Base on premises cluster.
  3. Drag an Output Port onto the NiFi canvas.
    You must use this port to make data available for download to the Cloudera Base on premises cluster.
  4. Connect your Cloudera on cloud data flow components.

    Ensure that you have specified the public endpoints of your NiFi nodes in the Cloudera on cloud cluster.

  5. Start your data flow and ensure that both the input and output ports are running.

When you have completed your Cloudera on cloud data flow, proceed by configuring Apache Ranger to allow NiFi's site-to-site transmission.