Activating an AWS environment from CDW
To use an AWS environment for Cloudera Data Warehouse (CDW) Public Cloud you must first activate it.
- Resource group
- Compute instances, which are virtual machine scale sets
- Load balancer(s)
- Public IP address(es)
- Network security group
- Disk(s)
CDW supports the EC2 instances as cluster nodes. CDP supports the following AWS compute
instance types (Hive and Impala executors), which you select during environment activation:
Instance type | Processor | Usage |
---|---|---|
r5d.4xlarge | Intel | Compute (default) |
r5ad.4xlarge | AMD | Compute |
r5dn.4xlarge | Intel | Compute |
m5.2xlarge | Intel | Shared services |
Instances are added to the cluster as needed for shared services (always on components). The CDW service will initially activate 3 shared m5.2xlarge instances to run the service in your environment. Additionally, CDW will activate 1 db.r5.large Amazon RDS instance using Postgres to manage Hue and Data Visualization user metadata.Three shared nodes are db.r5.large using Postgres, for the Amazon Relational Database Service (RDS). These shared nodes are used for Hue and Data Visualization user metadata. For more information, see Always active, shared services.
- Obtain the DWAdmin role.
- Review the AWS environment requirements.