DataFlow Deployments provides a cloud-native runtime to run your Apache NiFi flows through auto- scaling Kubernetes clusters. It also provides a centralized monitoring and alerting capability that results in improved Software Development Life Cycle (SDLC) for developers.
Get started with Cloudera DataFlow quickly by walking through a few simple steps. Ensure that administrators have access to CDP, enable DataFlow for an environment, give users access to CDF, and then add to the Catalog and deploy the Hello World ReadyFlow.
Learn about the infrastructure requirements of CDP and Cloudera DataFlow with supported cloud providers.
You can enable the DataFlow service for any supported environment you have registered with CDP. Once DataFlow has been enabled for an environment, you can start deploying flow definitions to it.
ReadyFlows are predefined, out-of-the-box data flows that can be immediately deployed by providing a set of required parameters.
Before you can deploy a flow definition in Cloudera DataFlow (CDF), you need to develop your data flow logic in a development environment using Apache NiFi. Download the NiFi flow as a flow definition, import it to Cloudera DataFlow and finally, deploy the flow. The flow definition acts as a configuration logic for your flow deployments. It enables you to deploy your data flow without the need to maintain cluster infrastructure. You can deploy the same flow to multiple environments in Cloudera DataFlow.
Your flow definitions are stored in the Catalog. This is where you manage the CDF flow definition lifecycle from import through versioning to deletion.
Inbound Connection Endpoints allow you to stream data from an external source application to a DataFlow deployment.
Learn about DataFlow security features and tasks, including authorization.
Learn about identifying and addressing errors throughout the lifecycle of a DataFlow deployment.