Starting your dataflow

After you start the dataflow, the GenerateFlowFile processor will continuously generate flow files based on the configured schedule and content settings, creating random data. You can confirm that you have successfully built a dataflow that can push data into Ozone by starting your dataflow and verifying that the data is moving through it.

  1. Select the processors that you want to start.
  2. Click the Start icon in the Actions toolbar.

    Alternatively, right-click a single processor and choose Start from the context menu.

    The Generate Flow File processor should generate 1 MB data every minute, and the generated data will be written in the "Directory" you configured earlier.

  3. You can verify that the files have indeed been written to the target directory by running the following command: ozone fs -ls ofs://ozone1/vol1/buck1/.
    ozone fs -ls ofs://ozone1/vol1/buck1/
    
    Found 4 items
    -rw-rw-rw-   3 systest systest    1048576 2020-10-27 18:05 ofs://ozone1/vol1/buck1/02e3bf6a-e419-4a12-9354-90e33f80f598
    -rw-rw-rw-   3 systest systest    1048576 2020-10-27 18:05 ofs://ozone1/vol1/buck1/a17b240a-6239-4262-b265-f1fc6af77882
    -rw-rw-rw-   3 systest systest    1048576 2020-10-27 18:05 ofs://ozone1/vol1/buck1/a6147d8c-91f6-41d6-a99d-7afb985dc96d
    -rw-rw-rw-   3 systest systest    1048576 2020-10-27 18:05 ofs://ozone1/vol1/buck1/f5610e4f-7b80-4a28-92ac-1646916a2324
    
  4. Make sure to adjust the configuration of the GenerateFlowFile processor, the data ingest processor, and any other connected processors to match your specific use case and dataflow requirements.