Adding Kudu as Catalog

You need to add Kudu as a Catalog using the Streaming SQL Console in SQL Stream Builder (SSB) to create Kudu tables with Flink DDL.

  • Make sure that you have Kudu service on your cluster.
  • Make sure that you have the right permissions set in Ranger.
  1. Navigate to the Streaming SQL Console.
    1. Navigate to Management Console > Environments, and select the environment where you have created your cluster.
    2. Select the Streaming Analytics cluster from the list of Data Hub clusters.
    3. Select Streaming SQL Console from the list of services.
    The Streaming SQL Console opens in a new window.
  2. Click Data Providers from the main menu.
  3. Click Register Catalog.
    The Add Catalog window appears.
  4. Add a Name to your catalog.
  5. Select Kudu from the Catalog Type drop-down.
  6. Add the Host URL of Kudu Masters.
    1. Go to your cluster in Cloudera Manager.
    2. Select Kudu from the list of services.
    3. Click on Instances.
    4. Copy the Hostname of the Master Default Group.
    5. Add the default port of Kudu after the hostname.
  7. Click on Add Filter.
    1. Provide a Database and Table filter if you want to select specific tables to use from the catalog.
  8. Click on Validate.
  9. If the validation is successful, click Add Tables.
Kudu is added as a Catalog and ready to be used in Flink DDL. The already existing tables in Kudu are automatically imported to SSB.