CDP Public Cloud Preview Features
The information in these pages is released as part of a preview for the features described. Access to preview features is provided upon request to customers for trial and evaluation. The components are provided ‘as is’ without warranty or support. Further, Cloudera assumes no liability for the use of preview components, which should be used by customers at their own risk. Please contact your Cloudera account team to have a preview feature enabled in your CDP account.
The cdpctl CLI provides the ability to check your cloud provider configuration and verify that it is ready to be used with CDP Public Cloud to register a CDP environment.
- Upgrading Data Hubs
You can upgrade a Data Hub cluster in one of three ways: Runtime and Cloudera Manager major/minor version upgrades, maintenance/“hotfix” upgrades, and OS upgrades.
- Fine-grained Access Control from S3 File Browser in Hue
Learn how to enable fine-grained access to S3 buckets from the S3 File Browser and Importer in Hue.
- Fine-grained Access Control from ABFS File Browser in Hue
Learn how to enable fine-grained access to ADLS Gen2 containers from the ABFS File Browser and Importer in Hue.
- Managed storage access for Azure
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for Azure.
- Managed storage access for AWS
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for AWS.
- Add Access to External S3 Buckets for CDW Clusters on AWS
Learn how to use the CDW UI to add access to external S3 buckets for CDW environments that run on AWS.
- Specifying Custom Environment Names in Cloudera Data Warehouse
Learn how to set custom environment names for your AWS or Azure cloud resources in CDW.
- Enable SSO for JDBC/ODBC Connections to Virtual Warehouses
Enable single sign-on (SSO) for third-party BI tool connections to Virtual Warehouses that use JDBC and ODBC.
- Hue: The Next Generation SQL Assistant for Hive in CDW
The one-stop SQL assistant for Hive/LLAP workloads in CDW with combined capabilities of Data Analytics Studio (DAS) and Hue.
- Enabling Multi-tenancy in Cloudera Data Warehouse
Achieve tenant isolation by creating a multi-tenant environment in CDW.
- Configure Impala Virtual Warehouses on AWS Environments to Spill to S3
Impala Virtual Warehouses on AWS environments can now be configured to write temporary data (spill) to S3 by specifying the S3 URI when you are creating the Virtual Warehouse.
- Enabling private CDW environment using Azure Kubernetes Service
Azure Kubernetes Service (AKS) simplifies container-based application deployment and management.
- Visualizing Data in Cloudera Data Warehouse Public Cloud
CDW integrates Data Visualization for building graphic representations of data, dashboards, and visual applications.
- Azure Spot instances for Virtual Warehouses
Cloudera Data Warehouse (CDW) now supports using Azure spot instances for Virtual Warehouses to reduce costs if you do not need fault tolerance.
- Support for CDP Private Cloud Base clusters in Data Catalog
Data Catalog now supports discovering and profiling assets that reside in CDP Private Cloud Base clusters.
- Supporting High Availability for Profiler services
The Data Catalog profiler services is now supported by enabling the High Availability (HA) feature.
- Navigating to tables and databases in Hue using Data Catalog
The integration between Data Catalog and Cloudera Data Warehouse (CDW) service provides a direct web link to the Hue instance from the Data Catalog web UI, making it easy to navigate across services.
- Integrating CDP Data Catalog with AWS Glue Data Catalog
While using AWS Glue in Data Catalog, you will be able to experience a complete snapshot metadata view, along with other visible attributes that can power your data governance capabilities.
- Experiments with MLflow
Cloudera Machine Learning now supports the MLflow tracking API and makes use of the MLflow client library as the default method to log experiments.
- CMK Encryption on AWS
Cloudera Machine Learning on AWS is now able to use a Customer Master Key (CMK) to encrypt data.
- ML Discovery & Exploration
Cloudera Machine Learning Discovery and Exploration accelerates the ML development workflow with preconfigured data connections and readily available code snippets.
- Data Lake Upgrade
When new versions of Cloudera Runtime/Cloudera Manager are available for the Data Lake service, you can initiate a Data Lake upgrade. You may also have the option to upgrade to a new OS image.
- Workload Password Policies
In order to bring your workload password complexity requirements in line with company policy, you can manage your FreeIPA password policies via CDP CLI.
- Using Customer Managed Keys for Encrypting Azure Managed Disks and Database
By default, local disks attached to Azure VMs are encrypted with server-side encryption (SSE) using Platform Managed Keys (PMK). This feature introduces CDP support for SSE with Customer Managed Keys (CMK) for CDP environments (Data Lake and FreeIPA) and Data Hubs.
- Using Customer Managed Encryption Keys for Encrypting GCP Disks and Database
By default, a Google-managed encryption key is used to encrypt disks and the Cloud SQL database used by the Data Lake, FreeIPA, and Data Hubs, but during environment registration you can optionally configure CDP to use a customer-managed encryption key (CMEK) instead.
- Fine-grained Access Control for Amazon S3
Ranger Authorization Service (RAZ) for Amazon S3 applies Ranger's fine-grained access control policies to CDP's access to Amazon S3 containers, directories, and files and can be controlled with admin-level access to CDP alone when enabled in a AWS CDP Public Cloud environment.
- Fine-grained Access Control for ADLS Gen2
Ranger Authorization Service (RAZ) for Azure Data Lake Storage (ADLS) Gen2 applies Ranger's fine-grained access control policies to CDP's access to ADLS Gen2 containers, directories, and files and can be controlled with admin-level access to CDP alone.
- Public Endpoint Access Gateway for Azure
You can enable Public Endpoint Access Gateway for Azure during Azure environment registration after enabling Cluster Connectivity Manager (CCM). Once activated, the gateway will be used for the Data Lake and all the Data Hubs within the environment.