Tutorial: customizing a ReadyFlow
Learn how to create a draft using a ReadyFlow as a template.
This tutorial shows you how you can open a ReadyFlow from the ReadyFlow Gallery and create a customized flow design using the ReadyFlow as a template. This tutorial uses the Hello World ReadyFlow, a simple flow design that retrieves the latest changes from Wikipedia through invoking the Wikipedia API. The flow converts JSON events to Avro, then filters and routes the events to two different processors which merge events together, and finally a file is written to local disk.
You will learn about the following actions:
- Opening a ReadyFlow as a draft on the Flow Design Canvas
- Creating a Controller Service
- Changing the configuration of processors
- Running a Test Session
- Publishing a draft to the Catalog as a flow definition
The Hello World ReadyFlow that you are about to customize can be deployed without any external dependencies and does not require any parameter values during deployment. Still, you must meet the following prerequisites before you can start building your first draft:
-
You must have an enabled and healthy Cloudera Data Flow environment.
-
You must be assigned the
DFDeveloper
role granting you access to the Flow Designer. -
You must be assigned the
DFCatalogAdmin
orDFCatalogViewer
role granting you access to the Catalog. You will need this authorization to publish your draft as a flow definition to the Catalog. -
You must be assigned the
DFFlowAdmin
role for the environment where you want to deploy the flow definition.