Creating a connection to run jobs on other CDE Virtual Clusters

Learn how to create a CDE type connection to run jobs on other CDE Virtual Clusters.

The following steps are for using the Airflow service provided with each CDE Virtual Cluster. For information about using your own Airflow deployment, see Using Cloudera Data Engineering with an external Apache Airflow deployment.

To create a connection to an existing CDE Virtual Cluster using the embedded Airflow UI, perform the following steps:

  1. In the Cloudera Data Platform (CDP) console, click the Data Engineering tile. The CDE Home page displays.
  2. Click Administration in the left navigation menu and select the service containing the Virtual Cluster that you are using.
  3. In the Virtual Clusters column, click Cluster Details for the Virtual Cluster.
  4. Click AIRFLOW UI.
  5. From the Airflow UI, click the Connection link from the Admin menu.
  6. Click the plus sign to add a new record and fill the following fields:
    • Conn Id: Create a unique connection identifier. For example, cde_runtime_api.
    • Conn Type: Select Cloudera Data Engineering.
    • Virtual Cluster API endpoint: Enter the target Virtual Clusters Jobs API URL.
    • CDP Access Key: Enter the CDP access key of the account for running jobs on the CDE Virtual Cluster.
    • CDP Private Key: Enter the CDP private key associated to the CDP Access Key that you have entered.
    • Extra: Extra arguments must in JSON format. Available extra parameters are as follows:
      • Proxy: Optional. Translates to https_proxy/HTTPS_PROXY environment variables. The default value is None.
      • Region: Optional. CDP Control Plane region ("us-west-1", "eu-1" or "ap-1") is inferred automatically, if not specified.

        For more information about the available extra arguments, see the GitHub page.

  7. Click Save.