Compute Instance Types

Learn about the supported AWS and Azure compute instance types that you can select while creating a Virtual Warehouse.

Supported AWS compute instance types

Cloudera Data Warehouse supports the following AWS compute instance types (Hive and Impala executors):

Instance type Processor Usage Virtual Warehouse Support
r7gd.4xlarge ARM Compute Impala
r6gd.4xlarge ARM Compute Impala
r6id.4xlarge Intel Compute Hive and Impala
r5d.4xlarge Intel Compute (default) Hive and Impala
r5ad.4xlarge AMD Compute Hive and Impala
r5dn.4xlarge Intel Compute Hive and Impala
m5.2xlarge Intel Shared services Hive and Impala

In the Cloudera Data Warehouse environment, instances for shared service components are set up within a Kubernetes (K8s) cluster. The setup begins with three m5.2xlarge instances running the Cloudera Data Warehouse service, but the K8s cluster is capable of autoscaling, automatically adding more instances if necessary to handle increased demand. Additionally, an Amazon Relational Database Service (RDS) (db.r5.large) running PostgreSQL is created to store user metadata for Hue and Data Visualization services. In total, three shared db.r5.large nodes are used for this purpose. For more information, see Always active, shared services.

Supported Azure compute instance types

Cloudera Data Warehouse supports the following Azure compute instance types (Hive and Impala executors):

Azure VM Processor Type Usage Virtual Warehouse Support
Standard_E16pds_v5 ARM Compute Impala
Standard_E16_v3 Intel Compute Hive and Impala
Standard_E16ds_v4 Intel Compute (default) Hive and Impala
Standard_E16ads_v5 AMD Compute Hive and Impala
Standard_E16ds_v5 Intel Compute Hive and Impala
Standard_D8s_v4 intel Shared services (default) Hive and Impala
Standard_D8as_v5 Intel Shared services, used with AMD compute instance Standard_E16ads_v5 Hive and Impala

Three instances are added to the cluster as needed for shared services (always on components). Three shared nodes are dStandard_E2s_v3 MemoryOptimized using flexserver, for the Amazon Relational Database Service (RDS). These shared nodes are used for Hue and Data Visualization user metadata. For more information, see Always active, shared services.