CDP Public Cloud Preview Features
The information in these pages is released as part of a preview for the features described. Access to preview features is provided upon request to customers for trial and evaluation. The components are provided ‘as is’ without warranty or support. Further, Cloudera assumes no liability for the use of preview components, which should be used by customers at their own risk. Please contact your Cloudera account team to have a preview feature enabled in your CDP account.
- Fine-grained Access Control from ABFS File Browser in Hue
Learn how to enable fine-grained access to ADLS Gen2 containers from the ABFS File Browser and Importer in Hue.
- Fine-grained Access Control from S3 File Browser in Hue
Learn how to enable fine-grained access to S3 buckets from the S3 File Browser and Importer in Hue.
- Using Custom Spark Runtime Docker Images via API/CLI
Learn how to run Spark jobs using custom Spark runtime Docker images via API/CLI.
- CDE In-place Upgrades
Cloudera Data Engineering (CDE) now supports upgrades from CDE 1.14 on both AWS and Azure.
- CDF Service Upgrade
Cloudera DataFlow (CDF) now supports upgrades from CDF 2.0.0 on both AWS and Azure.
- Enabling Fine-grained Access Control for Hue in Cloudera Data Warehouse for AWS environments
Learn how to enable fine-grained access to S3 buckets from Hue in Cloudera Data Warehouse.
- Enabling Fine-grained Access Control for Hue in Cloudera Data Warehouse for Azure environments
Learn how to enable fine-grained access to ADLS Gen2 containers from Hue in Cloudera Data Warehouse.
- Add Access to External S3 Buckets for CDW Clusters on AWS
Learn how to use the CDW UI to add access to external S3 buckets for CDW environments that run on AWS.
- Azure Spot instances for Virtual Warehouses
Cloudera Data Warehouse (CDW) now supports using Azure spot instances for Virtual Warehouses to reduce costs if you do not need fault tolerance.
- Reserving nodes for auto-scaling
To speed up Virtual Warehouse startup and autoscaling, keep some number of compute instances on standby. You configure extra buffer nodes to stand by, ready to join a new compute or autoscaled cluster.
- Integrating CDP Data Catalog with AWS Glue Data Catalog
While using AWS Glue in Data Catalog, you will be able to experience a complete snapshot metadata view, along with other visible attributes that can power your data governance capabilities.
- Navigating to tables and databases in Hue using Data Catalog
The integration between Data Catalog and Cloudera Data Warehouse (CDW) service provides a direct web link to the Hue instance from the Data Catalog web UI, making it easy to navigate across services.
- Support for CDP Private Cloud Base clusters in Data Catalog
Data Catalog now supports discovering and profiling assets that reside in CDP Private Cloud Base clusters.
- Supporting High Availability for Profiler services
The Data Catalog profiler services is now supported by enabling the High Availability (HA) feature.
- Transitioning Profiler Manager Service into SDX
The Profiler Manager Service is moved to the SDX infrastructure.
- Using the Download CSV option
Using the selected data lake, the search result for the current query can be downloaded.
- PBJ Workbench
The PBJ Workbench features a Jupyter Notebook editor pre-packaged with a runtime image. Data Scientists can easily choose this runtime image when launching a session, and then they can use the familiar Jupyter environment in their Cloudera Machine Learning workspace.
- Private Cluster Support
Private Clusters provide a simple way to create a secure cluster, where the API server and the workloads themselves only rely on private IP addresses that are not accessible from the internet.
- Experiments with MLflow
Cloudera Machine Learning now supports the MLflow tracking API and makes use of the MLflow client library as the default method to log experiments.
- CMK Encryption on AWS
Cloudera Machine Learning on AWS is now able to use a Customer Master Key (CMK) to encrypt data.
- Azure VM Encryption at Host
Description: You can optionally enable encryption at host for Data Lake, FreeIPA, and Data Hubs. Currently, you need to enable it individually for each Virtual Machine (VM) on Azure Portal.
- Data Lake Scaling
Data Lake scaling is the process of scaling up a light duty Data Lake to the medium duty form factor, which has greater resiliency than light duty and can service a larger number of clients.
- New UI for adding a CDP Private Cloud Base cluster
Register a CDP Private Cloud Base cluster as a classic cluster using Cloudera Manager and Knox endpoints so that you can use this cluster in Replication Manager and Data Catalog services.
- Public Endpoint Access Gateway for GCP
You can enable Public Endpoint Access Gateway for GCP during GCP environment registration after enabling Cluster Connectivity Manager (CCM). Once activated, the gateway will be used for the Data Lake and all the Data Hubs within the environment.
- Snapshot Policies in Replication Manager
You can create HDFS and HBase snapshot policies in Replication Manager to schedule taking snapshots of snapshottable HDFS directories and HBase tables at regular intervals. An HDFS directory is snapshottable after it has been enabled for snapshots, or because a parent directory is enabled for snapshots in Cloudera Manager.