Creating a Database Catalog
When you activate an environment in CDW, a default Database Catalog is created. You can create a new Database Catalog from the Data Warehouse UI.
If you register an environment and then activate the environment from Environments, the Database Catalog gives you access from CDW to an SDX Data Lake, as indicated above.
If you activate an environment from the CDW service, the Database Catalog gives you access from CDW to a DWX Data Lake.
If you activate an environment from the CDW service, a default Database Catalog is created automatically and named after your environment.
- Default resources
- Medium resources
- Large resources
The template you configure for the Database Catalog determines the container size of Hive Metastore (HMS) for storing your workload metadata and other resources. To avoid unnecessary cloud expenses, do not change the template configuration unless you experience Java heap issues.
Given Ranger permissions, you can access any objects or data sets created in the Data Hub or the Data Engineering clusters from CDW Virtual Warehouses and vice versa. The CDW service sets up the Kubernetes cluster, which provides the computing resources for the Database Catalog. The CDW service uses the existing data lake that was set up for the environment, including all data, metadata, and security. The following procedure shows you steps to create a Database Catalog to replace your default Database Catalog.