August 2023

This release of the Data Hub service introduces the following changes:

Support for Local SSDs in GCP (August 21, 2023)

Data Hub now supports using Local SSDs as storage in GCP. During Data Hub cluster creation in CDP, you can navigate to advanced Hardware and Storage options and select to use "Local scratch disk (SSD)" with certain instance types. Prior to using Local SSDs with CDP, make sure to review Google Cloud Platform documentation related to Local SSDs and familiarize yourself with the applicable restrictions and limitations (such as 12 TB maximum capacity, limited configurations).

Data Hub database upgrade and default major version change (August 30, 2023)

Newly-deployed Data Hub clusters on AWS or GCP with Cloudera Runtime 7.2.7 or above are now configured to use a PostgreSQL version 14 database by default.

Newly-deployed Data Hub clusters on Azure with Cloudera Runtime 7.2.7 or above will continue to use a PostgreSQL version 11 database by default.

The database for Data Hub clusters on AWS and GCP can now be upgraded to PostgreSQL version 14. If your AWS or GCP Data Hub cluster requires an upgrade to PostgreSQL 14, you will receive a notification in the Management Console UI.

Cloudera strongly recommends that the database upgrade to PostgreSQL 14 for AWS and GCP clusters is performed on all clusters running PostgreSQL version 11 by November 9, 2023.

A database upgrade to PostgreSQL 14 for Azure Data Hubs will be available in the future. Any Data Hub clusters on Azure that require a database upgrade will be upgraded from PostgreSQL 10 to PostgreSQL 11.

For more information, see Upgrading Data Lake/Data Hub database

Support for autoscaling Data Hub clusters on Azure (August 30, 2023)

Data Hub now supports autoscaling for clusters provisioned in Azure. For more information see Autoscaling clusters.