Creating a Database Catalog

When you activate an environment in CDW, a default Database Catalog is created. You can create a new Database Catalog from the Data Warehouse UI.

If you register an environment and then activate the environment from Environments, the Database Catalog gives you access from CDW to an SDX Data Lake, as indicated above.

If you activate an environment from the CDW service, the Database Catalog gives you access from CDW to a DWX Data Lake.

If you activate an environment from the CDW service, a default Database Catalog is created automatically and named after your environment.

You can configure a CDW resource template, which fulfills all requirements for a Database Catalog (dbc) for the type of Virtual Warehouse you have, for example a Hive or Impala Virtual Warehouse:
  • Default resources
  • Medium resources
  • Large resources

The template you configure for the Database Catalog determines the container size of Hive Metastore (HMS) for storing your workload metadata and other resources. To avoid unnecessary cloud expenses, do not change the template configuration unless you experience Java heap issues.

Given Ranger permissions, you can access any objects or data sets created in the Data Hub or the Data Engineering clusters from CDW Virtual Warehouses and vice versa. The CDW service sets up the Kubernetes cluster, which provides the computing resources for the Database Catalog. The CDW service uses the existing data lake that was set up for the environment, including all data, metadata, and security. The following procedure shows you steps to create a Database Catalog to replace your default Database Catalog.

You must have activated an environment in CDW.
  1. Log in to the Data Warehouse service as DWAdmin.
  2. Go to the Database Catalogs page and click New Database Catalog.
    The Create Database Catalog modal is displayed.
  3. In Name, specify a Database Catalog name.
  4. Select an environment from the Environments drop-down menu.
  5. Select the Hive version for constructing the Hive Metastore from the Database Catalog Image Version drop-down menu.
  6. Select a resource template from the Resource Template drop-down menu.
  7. Optional: Select the Load Demo Data to use sample airline data in Hue.
  8. Click Create Database Catalog.