CDP Public Cloud: June 2024 Release Summary

The CDP Public Cloud Release Summary summarizes major features introduced in CDP Public Cloud Management Console, Data Hub, and data services.

Data Catalog

This release of the Data Catalog only contains fixes and updates to prepare Data Catalog for the changes in the upcoming 3.0.0 release.

Data Engineering

This release (1.21.0-h2) of the Cloudera Data Engineering (CDE) service on CDP Public Cloud introduces the following changes.

Azure and AWS service backup and restore

If there were Airflow jobs running on a Virtual Cluster (VC) and you performed a cluster backup and restore, the CDE service and the VC were restored, but the Airflow jobs on the VC were not restored.

This issue has been fixed and now if you perform a backup and restore, the Airflow jobs are restored as well.

In-place upgrade using custom runtime images and public Docker registry

If you performed the in-place upgrade from CDE version 1.19 or version 1.20 to version 1.21 with a custom runtime image that uses a public Docker registry without a credential, the upgrade failed.

This issue has been fixed and now you can perform the in-place upgrade with a custom runtime image that uses a public Docker registry without a credential.

DataFlow

The June 2024 releases (2.8.0-h1-b1 and 2.8.0-h2-b2) of Cloudera DataFlow (CDF) on CDP Public Cloud introduce bug fixes only. See CDF Release Notes for more information.

Machine Learning

Version 2.0.45-b81 introduces fixed issues only. Version 2.0.45-b82 introduces the following new features:

  • You can now delete the Model Endpoint under AI Inference from the Cloudera Machine Learning UI.

For fixed issues, see CML release notes.

Management Console

This release of the Management Console service introduces the following changes:

Support for Hyderabad and Calgary AWS regions

The Asia Pacific Hyderabad AWS region and the Calgary AWS region is now supported. You can register CDP environments and provision Data Hubs in that region. See updated Supported AWS regions.

Support for Madrid Google Cloud region

The Madrid Google Cloud region is now supported. You can register CDP environments and provision Data Hubs in that region. See updated Supported GCP regions.

Replication Manager

This release of the Replication Manager service introduces the following HBase replication policy enhancements:

Replicate data through an network load balancer (NLB)

During the HBase replication policy creation process, you can choose to specify an NLB if the source CDH 5.16.2 cluster uses the NLB to communicate with ZooKeeper and RegionServers of the destination Cloudera Manager of COD clusters.

Specify the NLB details after you enable the Select Destination > Replicate via a Network Load Balancer option during the HBase replication policy creation process.

For more information, see Support matrix for CDP Replication Manager and Creating HBase replication policies.

Specify maximum number of tables to process in parallel

You can specify the maximum number of tables to process in parallel during the initial snapshot export and import step during an HBase replication policy run. You specify the number of tables using the Initial Snapshot Settings > Maximum Parallel Snapshots option during the HBase replication policy creation process. If you do not enter any value, Replication Manager chooses an appropriate value, depending on the resources in the source and target cluster, to optimize the performance.

For more information, see Support matrix for CDP Replication Manager and Creating HBase replication policies.