CDP Public Cloud Preview Features

The information in these pages is released as part of a preview for the features described. Access to preview features is provided upon request to customers for trial and evaluation. The components are provided ‘as is’ without warranty or support. Further, Cloudera assumes no liability for the use of preview components, which should be used by customers at their own risk. Please contact your Cloudera account team to have a preview feature enabled in your CDP account.

Onboarding

cdpctl
published: 2021-08-30; modified: 2021-09-15
The cdpctl CLI provides the ability to check your cloud provider configuration and verify that it is ready to be used with CDP Public Cloud to register a CDP environment.

Data Hub

Upgrading Data Hubs
published: 2021-10-29; modified: 2021-10-26
You can upgrade a Data Hub cluster in one of three ways: Runtime and Cloudera Manager major/minor version upgrades, maintenance/“hotfix” upgrades, and OS upgrades.
Fine-grained Access Control from S3 File Browser in Hue
published: 2021-10-21; modified: 2021-10-21
Learn how to enable fine-grained access to S3 buckets from the S3 File Browser and Importer in Hue.
Fine-grained Access Control from ABFS File Browser in Hue
published: 2021-10-21; modified: 2021-10-21
Learn how to enable fine-grained access to ADLS Gen2 containers from the ABFS File Browser and Importer in Hue.

Data Warehouse

Managed storage access for Azure
published: 2021-10-21; modified: 2021-10-21
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for Azure.
Managed storage access for AWS
published: 2021-10-21; modified: 2021-10-21
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for AWS.
Add Access to External S3 Buckets for CDW Clusters on AWS
published: 2021-05-12; modified: 2021-05-25
Learn how to use the CDW UI to add access to external S3 buckets for CDW environments that run on AWS.
Specifying Custom Environment Names in Cloudera Data Warehouse
published: 2021-04-14; modified: 2021-04-14
Learn how to set custom environment names for your AWS or Azure cloud resources in CDW.
Enable SSO for JDBC/ODBC Connections to Virtual Warehouses
published: 2021-05-21; modified: 2021-05-25
Enable single sign-on (SSO) for third-party BI tool connections to Virtual Warehouses that use JDBC and ODBC.
Hue: The Next Generation SQL Assistant for Hive in CDW
published: 2021-06-02; modified: 2021-06-10
The one-stop SQL assistant for Hive/LLAP workloads in CDW with combined capabilities of Data Analytics Studio (DAS) and Hue.
Enabling Multi-tenancy in Cloudera Data Warehouse
published: 2021-06-02; modified: 2021-06-02
Achieve tenant isolation by creating a multi-tenant environment in CDW.
Configure Impala Virtual Warehouses on AWS Environments to Spill to S3
published: 2021-06-14; modified: 2021-06-14
Impala Virtual Warehouses on AWS environments can now be configured to write temporary data (spill) to S3 by specifying the S3 URI when you are creating the Virtual Warehouse.
Enabling private CDW environment using Azure Kubernetes Service
published: 2021-05-03; modified: 2021-07-29
Azure Kubernetes Service (AKS) simplifies container-based application deployment and management.
Visualizing Data in Cloudera Data Warehouse Public Cloud
published: 2021-08-27; modified: 2021-08-27
CDW integrates Data Visualization for building graphic representations of data, dashboards, and visual applications.
Azure Spot instances for Virtual Warehouses
published: 2021-09-28; modified: 2021-09-28
Cloudera Data Warehouse (CDW) now supports using Azure spot instances for Virtual Warehouses to reduce costs if you do not need fault tolerance.

Governance

Support for CDP Private Cloud Base clusters in Data Catalog
published: 2021-08-06; modified: 2021-08-06
Data Catalog now supports discovering and profiling assets that reside in CDP Private Cloud Base clusters.
Supporting High Availability for Profiler services
published: 2021-08-07; modified: 2021-08-07
The Data Catalog profiler services is now supported by enabling the High Availability (HA) feature.
Navigating to tables and databases in Hue using Data Catalog
published: 2021-08-07; modified: 2021-08-07
The integration between Data Catalog and Cloudera Data Warehouse (CDW) service provides a direct web link to the Hue instance from the Data Catalog web UI, making it easy to navigate across services.
Integrating CDP Data Catalog with AWS Glue Data Catalog
published: 2021-08-09; modified: 2021-08-09
While using AWS Glue in Data Catalog, you will be able to experience a complete snapshot metadata view, along with other visible attributes that can power your data governance capabilities.

Machine Learning

Experiments with MLflow
published: 2021-10-27; modified: 2021-10-27
Cloudera Machine Learning now supports the MLflow tracking API and makes use of the MLflow client library as the default method to log experiments.
CMK Encryption on AWS
published: 2021-08-10; modified: 2021-11-30
Cloudera Machine Learning on AWS is now able to use a Customer Master Key (CMK) to encrypt data.
ML Discovery & Exploration
published: 2021-08-31; modified: 2021-09-16
Cloudera Machine Learning Discovery and Exploration accelerates the ML development workflow with preconfigured data connections and readily available code snippets.

Management Console

Data Lake Upgrade
published: 2021-06-03; modified: 2021-10-27
When new versions of Cloudera Runtime/Cloudera Manager are available for the Data Lake service, you can initiate a Data Lake upgrade. You may also have the option to upgrade to a new OS image.
Workload Password Policies
published: 2021-04-27; modified: 2021-10-28
In order to bring your workload password complexity requirements in line with company policy, you can manage your FreeIPA password policies via CDP CLI.
Using Customer Managed Keys for Encrypting Azure Managed Disks and Database
published: 2021-05-13; modified: 2021-11-24
By default, local disks attached to Azure VMs are encrypted with server-side encryption (SSE) using Platform Managed Keys (PMK). This feature introduces CDP support for SSE with Customer Managed Keys (CMK) for CDP environments (Data Lake and FreeIPA) and Data Hubs.
Using Customer Managed Encryption Keys for Encrypting GCP Disks and Database
published: 2021-11-17; modified: 2021-11-17
By default, a Google-managed encryption key is used to encrypt disks and the Cloud SQL database used by the Data Lake, FreeIPA, and Data Hubs, but during environment registration you can optionally configure CDP to use a customer-managed encryption key (CMEK) instead.
Fine-grained Access Control for Amazon S3
published: 2021-06-15; modified: 2021-06-15
Ranger Authorization Service (RAZ) for Amazon S3 applies Ranger's fine-grained access control policies to CDP's access to Amazon S3 containers, directories, and files and can be controlled with admin-level access to CDP alone when enabled in a AWS CDP Public Cloud environment.
Fine-grained Access Control for ADLS Gen2
published: 2021-08-10; modified: 2021-09-07
Ranger Authorization Service (RAZ) for Azure Data Lake Storage (ADLS) Gen2 applies Ranger's fine-grained access control policies to CDP's access to ADLS Gen2 containers, directories, and files and can be controlled with admin-level access to CDP alone.
Public Endpoint Access Gateway for Azure
published: 2021-07-23; modified: 2021-07-27
You can enable Public Endpoint Access Gateway for Azure during Azure environment registration after enabling Cluster Connectivity Manager (CCM). Once activated, the gateway will be used for the Data Lake and all the Data Hubs within the environment.