(Deprecated) Creating a connection to Cloudera Data Warehouse for Cloudera Data Warehouse Operator
Learn how to create an Airflow connection to an existing Cloudera Data Warehouse before running the workloads using the Cloudera Data Warehouse Operator.
The following
steps are for using the Airflow service provided with each Cloudera Data Engineering virtual cluster. For
information about using your own Airflow deployment, see Using Cloudera Data Engineering with an external
Apache Airflow deployment.
To determine the Cloudera Data Warehouse hostname to use for the connection, perform the following steps:
- In the Cloudera Management Console, click the Data Warehouse tile and click Overview.
- In the Virtual Warehouses column, locate the Hive or Impala warehouse you want to connect to.
- Click
next to the selected Warehouse, and then click Copy JDBC URL.
- Paste the URL into a text editor, and make note of the hostname.For example,
In this JDBC URL, the hostname is hs2-aws-2-hive.env-k5ip0r.dw.ylcu-atmi.cloudera.site.jdbc:hive2://hs2-aws-2-hive.env-k5ip0r.dw.ylcu-atmi.cloudera.site/default;transportMode=http;httpPath=cliservice;ssl=true;retries=3;
To create a connection to an existing Cloudera Data Warehouse virtual warehouse using the embedded Airflow UI, perform the following steps: