CDP Public Cloud Preview Features

The information in these pages is released as part of a preview for the features described. Access to preview features is provided upon request to customers for trial and evaluation. The components are provided ‘as is’ without warranty or support. Further, Cloudera assumes no liability for the use of preview components, which should be used by customers at their own risk. Please contact your Cloudera account team to have a preview feature enabled in your CDP account.

Data Hub

Fine-grained Access Control from ABFS File Browser in Hue
published: 2021-10-21; modified: 2021-10-21
Learn how to enable fine-grained access to ADLS Gen2 containers from the ABFS File Browser and Importer in Hue.
Fine-grained Access Control from S3 File Browser in Hue
published: 2021-10-21; modified: 2021-10-21
Learn how to enable fine-grained access to S3 buckets from the S3 File Browser and Importer in Hue.

Data Engineering

CDE In-place Upgrades
published: 2021-04-27; modified: 2021-04-27
Cloudera Data Engineering (CDE) now supports upgrades from CDE 1.14 on both AWS and Azure.

DataFlow

CDF Service Upgrade
published: 2022-06-28; modified: 2022-06-28
Cloudera DataFlow (CDF) now supports upgrades from CDF 2.0.0 on both AWS and Azure.

Data Warehouse

Enabling Fine-grained Access Control for Hue in Cloudera Data Warehouse for AWS environments
published: 2022-04-27; modified: 2022-04-27
Learn how to enable fine-grained access to S3 buckets from Hue in Cloudera Data Warehouse.
Enabling Fine-grained Access Control for Hue in Cloudera Data Warehouse for Azure environments
published: 2022-04-27; modified: 2022-04-27
Learn how to enable fine-grained access to ADLS Gen2 containers from Hue in Cloudera Data Warehouse.
Add Access to External S3 Buckets for CDW Clusters on AWS
published: 2021-05-12; modified: 2021-05-25
Learn how to use the CDW UI to add access to external S3 buckets for CDW environments that run on AWS.
Azure Spot instances for Virtual Warehouses
published: 2021-09-28; modified: 2021-09-28
Cloudera Data Warehouse (CDW) now supports using Azure spot instances for Virtual Warehouses to reduce costs if you do not need fault tolerance.
Enable SSO for JDBC/ODBC Connections to Virtual Warehouses
published: 2021-05-21; modified: 2021-05-25
Enable single sign-on (SSO) for third-party BI tool connections to Virtual Warehouses that use JDBC and ODBC.
Managed Storage Access for AWS
published: 2021-10-21; modified: 2022-02-01
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for AWS.
Managed storage access for Azure
published: 2021-10-21; modified: 2021-10-21
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for Azure.

Governance

Integrating CDP Data Catalog with AWS Glue Data Catalog
published: 2021-08-09; modified: 2021-12-08
While using AWS Glue in Data Catalog, you will be able to experience a complete snapshot metadata view, along with other visible attributes that can power your data governance capabilities.
Navigating to tables and databases in Hue using Data Catalog
published: 2021-08-07; modified: 2021-08-07
The integration between Data Catalog and Cloudera Data Warehouse (CDW) service provides a direct web link to the Hue instance from the Data Catalog web UI, making it easy to navigate across services.
Support for CDP Private Cloud Base clusters in Data Catalog
published: 2022-02-24; modified: 2022-04-06
Data Catalog now supports discovering and profiling assets that reside in CDP Private Cloud Base clusters.
Supporting High Availability for Profiler services
published: 2021-08-07; modified: 2021-08-07
The Data Catalog profiler services is now supported by enabling the High Availability (HA) feature.
Transitioning Profiler Manager Service into SDX
published: 2022-02-24; modified: 2022-02-24
The Profiler Manager Service is moved to the SDX infrastructure.
Using the Download CSV option
published: 2022-02-24; modified: 2022-02-24
Using the selected data lake, the search result for the current query can be downloaded.

Machine Learning

PBJ Workbench
published: 2022-04-21; modified: 2022-04-28
The PBJ Workbench features a Jupyter Notebook editor pre-packaged with a runtime image. Data Scientists can easily choose this runtime image when launching a session, and then they can use the familiar Jupyter environment in their Cloudera Machine Learning workspace.
Data Discovery and Exploration
published: 2022-04-21; modified: 2022-04-21
Data Discovery and Exploration enables you to connect to data sources, explore them with SQL commands, and build visualizations and dashboards with that data, all from within CML.
ML Workspace Backup and Restore
published: 2022-02-10; modified: 2022-05-09
Cloudera Machine Learning Workspace Backup and Restore enables you to backup and restore workspace data and metadata to protect against system failures.
Private Cluster Support
published: 2022-01-06; modified: 2022-01-06
Private Clusters provide a simple way to create a secure cluster, where the API server and the workloads themselves only rely on private IP addresses that are not accessible from the internet.
Experiments with MLflow
published: 2021-10-27; modified: 2021-10-27
Cloudera Machine Learning now supports the MLflow tracking API and makes use of the MLflow client library as the default method to log experiments.
CMK Encryption on AWS
published: 2021-08-10; modified: 2022-02-10
Cloudera Machine Learning on AWS is now able to use a Customer Master Key (CMK) to encrypt data.

Management Console

Azure VM Encryption at Host
published: 2022-06-06; modified: 2022-06-06
Description: You can optionally enable encryption at host for Data Lake, FreeIPA, and Data Hubs. Currently, you need to enable it individually for each Virtual Machine (VM) on Azure Portal.
Data Lake Scaling
published: 2022-05-11; modified: 2022-05-11
Data Lake scaling is the process of scaling up a light duty Data Lake to the medium duty form factor, which has greater resiliency than light duty and can service a larger number of clients.
New UI for adding a CDP Private Cloud Base cluster
published: 2022-03-29; modified: 2022-03-29
Register a CDP Private Cloud Base cluster as a classic cluster using Cloudera Manager and Knox endpoints so that you can use this cluster in Replication Manager and Data Catalog services.
Public Endpoint Access Gateway for GCP
published: 2021-12-17; modified: 2021-12-17
You can enable Public Endpoint Access Gateway for GCP during GCP environment registration after enabling Cluster Connectivity Manager (CCM). Once activated, the gateway will be used for the Data Lake and all the Data Hubs within the environment.

Replication Manager

Snapshot Policies in Replication Manager
published: 2022-02-25; modified: 2022-02-25
You can create HDFS and HBase snapshot policies in Replication Manager to schedule taking snapshots of snapshottable HDFS directories and HBase tables at regular intervals. An HDFS directory is snapshottable after it has been enabled for snapshots, or because a parent directory is enabled for snapshots in Cloudera Manager.