Sampling data for a running job

You can sample data from a running job to the Streaming SQL Console without writing it to any sink to inspect the data and to make sure the job is running correctly, and producing the results you expect.

Sampling the results to your browser allows you to inspect the queried data and iterate on your query. You can sample 100 rows in the Results tab under the SQL Editor on the Console page. Even if you do not specify any sink to the SQL job, the results automatically appear in the Results tab.

  1. Navigate to the Streaming SQL Console.
    1. Go to your cluster in Cloudera Manager.
    2. Select SQL Stream Builder from the list of services.
    3. Click SQLStreamBuilder Console.
    The Streaming SQL Console opens in a new window.
  2. Create a new job or select an existing job on the Getting Started window.
  3. Execute the job.
    The samples are automatically polled to the Console under the Results tab until the samples reach the maximum number of result or you stop the job.
Sample results are displayed in the results window. If there is no data meeting the SQL query, sampling stops after a few attempts.