What's new in Cloudera Flow Management

Discover the new functionalities and improvements in Cloudera Flow Management (CFM) 2.1.7 and learn how these new features can benefit you.

The CFM 2.1.7 release is based on Apache NiFi 1.26.0 and it also incorporates a lot of Cloudera exclusive features and improvements.

Here is an overview of what is new in this release:

New processors

CalculateParquetOffsets
CalculateParquetRowGroupOffsets

These processors can be used in combination with ConvertRecord and Parquet Reader to significantly reduce the time required to convert very large Parquet files into another format.

CaptureChangeDebeziumDB2
CaptureChangeDebeziumMySQL
CaptureChangeDebeziumOracle
CaptureChangeDebeziumPostgreSQL
CaptureChangeDebeziumSQLServer

Currently in Technical Preview, these processors leverage the Debezium project to ingest Change Data Capture (CDC) events from external databases.

DecryptContentAge
EncryptContentAge

These new-generation processors are designed for data encryption and decryption. For more information, see Modernizing Streaming Encryption with age in Apache NiFi.

ListenOTLP

This processor enables NiFi to act as a destination for OpenTelemetry Protocol (OTLP) agents to receive OpenTelemetry data from external applications. For more information about this new processor, see Building OpenTelemetry Collection in Apache NiFi with Netty.

PutClouderaHiveQL
PutClouderaHiveStreaming
PutClouderaORC
SelectClouderaHiveQL
UpdateClouderaHiveTable
TriggerClouderaHiveMetaStoreEvent

These Cloudera exclusive components are designed to interact with Hive-based components in the Cloudera Data Platform. Since Hive components will no longer be part of Apache NiFi starting with NiFi 2.0, it is highly recommended to switch to these components as soon as possible to make the upgrade/migration to NiFi 2.0 easier.

PutJiraIssue

This processor allows you to to create new issues in Jira using the Jira REST API.

PutZendeskTicket

This processor allows you to create Zendesk tickets using the Zendesk API.

ConsumeElasticsearch

This processor repeatedly runs a paginated query against a field using a Range query to consume new documents from an Elasticsearch index/query.

FilterAttribute

Thi processor filters the attributes of a FlowFile by retaining specified attributes and removing the rest or by removing specified attributes and retaining the rest.

PackageFlowFile

This processor packages FlowFile attributes and content into an output FlowFile that can be exported from NiFi and imported back into NiFi, preserving the original attributes and content.

PublishSlack

This processor allows you to post a message to the specified Slack channel.

New controller services

ActiveMQJMSConnectionFactoryProvider

This controller service allows you to interact with ActiveMQ without the need to deploy the JMS client on all of the NiFi nodes.

ClouderaHiveConnectionPool

This controller service allows you to interact with Hive without the need to deploy the required dependencies on all of the NiFi nodes.

DatabaseTableSchemaRegistry

This controller service enables you to retrieve the schema associated with a table from an external database. This allows you to validate the data going through NiFi against that schema before pushing the data into this table.

ImpalaConnectionPool

This controller service allows you to interact with Impala without the need to deploy the required dependencies on all NiFi nodes.

RabbitMQJMSConnectionFactoryProvider

This controller service allows you to interact with RabbitMQ without the need to deploy the JMS client on all NiFi nodes.

ProtobufReader

This record reader allows you to read Protobuf data with the record based components.

YamlTreeReader

This record reader allows you to read YAML data with the record based components.