Adding a new Virtual Warehouse
This topic describes the procedure for creating a new Virtual Warehouse in Cloudera Data Warehouse (CDW) Public Cloud.
A Virtual Warehouse is an instance of compute resources that is equivalent to a cluster. A Virtual Warehouse provides access to the data in tables and views in the data lake that correlates to a specific Database Catalog. Virtual Warehouses can only lookup the Database Catalog that they have been configured to access.
When you create a Virtual Warehouse, a cluster is created in your AWS account. This cluster has two buckets. One bucket is used for managed data and the other is used for external data.
Required role: DWAdmin
Before you create a new Virtual Warehouse, determine what is the number of concurrent queries or users your Virtual Warehouse must serve during peak periods. This information helps you determine what size of Virtual Warehouse you need. Choose the size based on the number of nodes you typically use for clusters in an on-premises deployment. Also consider the complexity of your queries and the size of the data sets that they access. Larger sized warehouses with more executor nodes can cache more data, which enhances performance.
Virtual Warehouse sizes you can choose from:
|Virtual Warehouse Size||Number of Nodes|
|Custom||Enter a value between '1' and '100'|
- Log in to the CDP web interface and navigate to the Data Warehouse service.
- In the Data Warehouse service, click Virtual Warehouses in the left navigation panel.
- On the Virtual Warehouses page, click Add New.
In the New Virtual Warehouse dialog box, specify a Name, its
Type (Hive or Impala), which Database
Catalog it queries,User Groups that can access
endpoints, keys and values for Tagging the Virtual Warehouse, and
- When you specify a Size, you can configure auto-scaling thresholds. For more information about tuning auto-scaling, see the links in the "Related information" section at the bottom of this page.
- Click Create to create the new Virtual Warehouse.