Creating Kafka tables using Console wizard

After registering a Kafka data provider, you can use the Add table wizard in Streaming SQL Console to create a Kafka table.

  • Make sure that you have registered Kafka as a Data Provider.
  • Make sure that you have created topics in Kafka.
  • Make sure there is generated data in the Kafka topic.
  • Make sure that you have the right permissions set in Ranger.
  1. Navigate to the Streaming SQL Console.
    1. Navigate to Management Console > Environments, and select the environment where you have created your cluster.
    2. Select the Streaming Analytics cluster from the list of Data Hub clusters.
    3. Select Streaming SQL Console from the list of services.
    The Streaming SQL Console opens in a new window.
  2. Select Console from the left- side menu.
  3. Go to the Tables tab.
  4. Select Add table > Apache Kafka.
    The Kafka Table window appears.
  5. Provide a Name for the Table.
  6. Select a registered Kafka provider as Kafka cluster.
  7. Select a Kafka topic from the list.
  8. Select the Data format.
    • You can select JSON as data format.
    • You can select AVRO as data format.
  9. Determine the Schema for the Kafka table.
    1. Add a customized schema to the Schema Definition field.
    2. Click Detect Schema to read a sample of the JSON messages and automatically infer the schema.
  10. Customize your Kafka Table with the following options:
    1. Configure the Event Time if you do not want to use the Kafka Timestamps.
      1. Unselect the checkbox of Use Kafka Timestamps.
      2. Provide the name of the Input Timestamp Column.
      3. Add a name for the Event Time Column.
      4. Add a value to the Watermark Seconds.
    2. Configure an Input Transform, add the code using the Transformations tab.
    3. Configure any Kafka properties required using the Properties tab.
    For more information about how to configure the Kafka table, see the Configuring Kafka tables section.
  11. Select Save Changes.
The Kafka Table is ready to be used for the SQL job either at the FROM or at the INSERT INTO statements.