Analyzing your data with Kudu
After preparing your environment, you need to create a Kudu table, and also connect Flink with Kudu by providing the Kudu master. You also need to choose a source to which you connect Flink in Data Hub. After generating data to your source, Flink applies the computations you have added in your application design. The results are redirected to your Kudu sink.
- You have a CDP Public Cloud environment.
- You have a CDP username (it can be your own CDP user or a CDP machine user) and
a password set to access Data Hub clusters.
The predefined resource role of this user is at least EnvironmentUser. This resource role provides the ability to view Data Hub clusters and set the FreeIPA password for the environment.
- Your user is synchronized to the CDP Public Cloud environment.
- You have a Streaming Analytics cluster.
- You have a Real-time Data Mart cluster in the same Data Hub environment as the Streaming Analytics cluster.
- Your CDP user has the correct permissions set up in Ranger allowing access to Kudu.
- You obtained the Kudu Master hosts:
- Go to .
- Search for your environment from the list of available environments.
- Select the Data Hub cluster within your environment from the list of available clusters.
- Select Kudu Master from the list of Services.
- Click Masters.
- Copy the host information from the list of Live Masters.