Compute Instance Types
Learn about the supported AWS and Azure compute instance types that you can select while creating a Virtual Warehouse.
Supported AWS compute instance types
Cloudera Data Warehouse supports the following AWS compute instance types (Hive and Impala executors):
Instance type | Processor | Usage | Virtual Warehouse Support |
---|---|---|---|
r7gd.4xlarge | ARM | Compute | Impala |
r6gd.4xlarge | ARM | Compute | Impala |
r6id.4xlarge | Intel | Compute | Hive and Impala |
r5d.4xlarge | Intel | Compute (default) | Hive and Impala |
r5ad.4xlarge | AMD | Compute | Hive and Impala |
r5dn.4xlarge | Intel | Compute | Hive and Impala |
m5.2xlarge | Intel | Shared services | Hive and Impala |
In the Cloudera Data Warehouse environment, instances for shared service components are set up within a Kubernetes (K8s) cluster. The setup begins with three m5.2xlarge instances running the Cloudera Data Warehouse service, but the K8s cluster is capable of autoscaling, automatically adding more instances if necessary to handle increased demand. Additionally, an Amazon Relational Database Service (RDS) (db.r5.large) running PostgreSQL is created to store user metadata for Hue and Data Visualization services. In total, three shared db.r5.large nodes are used for this purpose. For more information, see Always active, shared services.
Supported Azure compute instance types
Cloudera Data Warehouse supports the following Azure compute instance types (Hive and Impala executors):
Azure VM | Processor Type | Usage | Virtual Warehouse Support |
---|---|---|---|
Standard_E16pds_v5 | ARM | Compute | Impala |
Standard_E16_v3 | Intel | Compute | Hive and Impala |
Standard_E16ds_v4 | Intel | Compute (default) | Hive and Impala |
Standard_E16ads_v5 | AMD | Compute | Hive and Impala |
Standard_E16ds_v5 | Intel | Compute | Hive and Impala |
Standard_D8s_v4 | intel | Shared services (default) | Hive and Impala |
Standard_D8as_v5 | Intel | Shared services, used with AMD compute instance Standard_E16ads_v5 | Hive and Impala |
Three instances are added to the cluster as needed for shared services (always on components). Three shared nodes are dStandard_E2s_v3 MemoryOptimized using flexserver, for the Amazon Relational Database Service (RDS). These shared nodes are used for Hue and Data Visualization user metadata. For more information, see Always active, shared services.