What's New in Cloudera Data Flow for Data Hub

This section lists major features and updates for Cloudera Data Flow for Data Hub.

August 17, 2020

This release includes the Technical Preview for Streaming Analytics Heavy Duty cluster definitions. The Streaming Analytics Heavy Duty templates include Flink, Zookeeper, HDFS, and YARN with RocksDB as state backend. There are two template options added in this release:

  • Streaming Analytics Light Duty for Azure
  • Streaming Analytics Heavy Duty for AWS
  • Streaming Analytics Heavy Duty for Azure

July 31, 2020

Refresh on NiFi 1.11.4

The Flow Management templates available in CDP Public Cloud have been updated with a refreshed version of Apache NiFi 1.11.4. They include a number of dependency upgrades to ensure that NiFi successfully integrates with Cloudera Runtime components.

Technical Preview for Streaming Analytics clusters

This release includes the Technical Preview for Streaming Analytics cluster definitions. The Streaming Analytics templates include Flink, Zookeeper, HDFS and YARN. There are two template options available:

  • Streaming Analytics Light Duty for AWS

Streaming Analytics offers real-time stream processing and stream analytics with low-latency and high-scaling capabilities powered by Apache Flink.

The Streaming Analytics templates offer Apache Flink that works out of the box in secure environment on CDP Public Cloud. The following features are supported in Streaming Analytics for Data Hub:
  • Data source reading from Kafka
  • Data sinks writing to Kafka, HBase and Kudu
  • Apache Atlas integration
  • SQL/Table API and SQL Client