Terminology

This tutorial uses the following terms and concepts that you should be familiar with.

Term Definition
Apache NiFi Low-code data ingestion tool built to automate the flow of data between systems
Flow Represents data flow logic that was developed using Apache NiFi
Processor Component in the data flow that perform work combining data routing, transformation, and mediation between systems
Cloudera Public Cloud Cloudera’s data management platform in the cloud
Cloudera DataFlow Service Cloudera DataFlow data service that enables self-serve deployments of Apache NiFi
Cloudera DataFlow function Flow that is uploaded into the Cloudera DataFlow Catalog that can be run in serverless mode by serverless cloud provider services
Cloudera DataFlow Catalog Inventory of flow definitions from which you can initiate new deployments
AWS Lambda Serverless, event-driven compute service that lets you run a Cloudera DataFlow function without provisioning or managing servers