Adding access to external S3 buckets for Cloudera Data Warehouse clusters on AWS
This section explains how to add and query the data in additional external S3 buckets Cloudera Data Warehouse (CDW) service clusters running on AWS environments.
When you create a Virtual Warehouse in the CDW service, a cluster is created in your AWS account. This cluster has two buckets. One bucket is used for managed data and the other is used for external data. The naming convention for these two S3 buckets that are created by the CDW service is:
For example, if you specified the bucket name
dwx-datawhen you registered your environment with Management Console, the managed data S3 bucket might be named something like:
Continuing the above scenario where you specified
dwx-dataas the bucket name during environment registration, the external S3 bucket might be named:
Access to these two buckets is controlled by AWS instance profiles. To add additional external AWS S3 buckets to your CDW service cluster, you must edit the instance profile to add read/write permissions to the additional buckets.