Integrating third-party tools

To use third-party BI tools, such as Tableau, with Cloudera Data Warehouse service, you must configure the connection between your BI tool and the service. These instructions configure a connection with Tableau, but the same JDBC JAR file and URL can be used to connect other BI tools.

Before you can use your BI tool with the Data Warehouse service, you must have created a Database Catalog that is populated with data. You have the option to populate your Database Catalog with sample data when you create it. You must also create a Virtual Warehouse and configure it to connect to the Database Catalog that is populated with data.

  1. Log in to the CDP web interface and navigate to the Data Warehouse service.
  2. In the Data Warehouse service, click Virtual Warehouses in the left navigation panel.
  3. Download the latest JDBC Driver from the Cloudera Downloads page (recommended), or alternatively, on the Virtual Warehouses page, click the options menu for the warehouse you want to connect to your BI tool, and select Download JDBC Jar and install it:


    See the Tableau documentation for the location where you must place the JAR file.

  4. In Tableau, select Data > New Data Source. The Connect dialog box appears.
  5. In the Connect dialog box, in the list of More... servers, search for Cloudera Hadoop and launch the Cloudera Hadoop dialog box:


  6. In the Data Warehouse service Overview page, for the Virtual Warehouse you want to connect to Tableau, in the options menu, click Copy JDBC URL:


    When you click this, it copies a URL on your system clipboard similar to the below URL:

    
    jdbc:hive2://<your_virtual_warehouse>.<your_environment>.<dwx.company.com>/default;transportMode=http;httpPath=cliservice;ssl=true;retries=3
                  
  7. Paste the URL into a text editor and copy the following section into the Server field of the Tableau Cloudera Hadoop dialog box:
    
    <your_virtual_warehouse>.<your_environment>.<dwx.company.com>
                  
  8. Then in the Tableau Cloudera Hadoop dialog box, set the following other options:
    • Port: 443
    • Type: HiveServer2
    • Authentication: Username and Password
    • Transport: HTTP
    • Username: Username you use to connect to the CDP Data Warehouse service
    • Password: Password you use to connect to the CDP Data Warehouse service
    • HTTP Path: cliservice
    • Require SSL: is checked
  9. Click Sign In.