Creating a CML data connection to an Impala data warehouse

Learn how to connect natively to data stored in Impala when using CDP Data Visualization in Cloudera Machine Learning (CML).

You must connect to your data prior to using the data modeling and visualization framework of CDP Data Visualization. The following steps show you how to create a new CML data connection to an Impala data warehouse.

  1. On the main navigation bar, click Data.

    The Data view appears, open on the Datasets tab.

  2. In the sidebar, click New Connection.
    The Create New Data Connection modal window appears.
    Create new connection
  3. Select the Impala Connection type from the drop-down list and enter the hostname or IP address of the running coordinator. You can get the coordinator hostname from the JDBC URL of the Impala DW.
  4. Use port 443.
  5. Click the Advanced tab and make the selections below:
  6. Use your workload username and password as credentials.
  7. Test and then Save the connection.
You have set up a connection to a running Impala DW.