Configuring a CDW-Kudu connection
The steps for making the connection involve setting a Impala coordinator property to the fully qualfied domain names (FQDNs) of the Kudu master hosts.
- You must meet the prerequisites listed in the "Connecting CDW and Kudu".
- You need to have gathered the FQDNs of Kudu master hosts as described in the previous topic, "Gathering Kudu master FQDNs".
- From the Management Console or CDP landing page, navigate to Data Warehouses.
- In Virtual Warehouses, select your Impala Virtual Warehouse, and click .
In Details, click
If you have a new Virtual Warehouse created using version 2023.0.13-20, select
flagfile from the drop-down list, and set the
kudu_master_hosts key to a list of all Kudu master FQDNs.
New Virtual Warehouse example:
This screenshot shows the three FQDNs, oversimplified for readability.
kudu_master_hosts=kudu-master1.sandbox.a.cloudera.site:7051,kudu-master2.sandbox.a.cloudera.site:7051,kudu-master3.sandbox.a.cloudera.site:7051Existing Virtual Warehouse:If you upgraded an existing Virtual Warehouse to version 2023.0.13.20, you must perform two configurations:
- Click +, and in Add Custom Configurations, set kudu_master_hosts to the FQDN of a single master host as shown below:
- Select flagfile from the drop-down list, and add the FQDNs of other Kudu master hosts to the value of kudu_master_hosts.
Apply the change.
When the Virtual Warehouse finishes updating, you can query Kudu tables from Hue, an Impala shell, or an ODBC/JDBC client. For example:
CREATE TABLE my_first_kudu_table ( id BIGINT, name STRING, PRIMARY KEY(id) ) PARTITION BY HASH PARTITIONS 16 STORED AS KUDU; Insert into my_first_kudu_table values (101,’A’),(102,’B’),(103,’C’); Select * from my_first_kudu_table;