Supported NiFi processors

Learn about the processors supported in Flow Management Data Hub clusters using Apache NiFi 1 or NiFi 2 in Cloudera DataFlow for Data Hub 7.3.1.400.

To ensure optimal performance and reliable support, it is crucial to use only supported processors and avoid deploying unsupported ones in production environments.

Additional processors are developed and tested by the community but are not officially supported by Cloudera. Processors may be excluded for various reasons, including insufficient reliability, incomplete test coverage, community declaration of non-production readiness, or deviations from Cloudera best practices.

By adhering to the above guidelines, you can maintain stable and reliable workflows in your production environments.

NiFi 1.28.1 in Cloudera Flow Management 2.2.9.400

AttributesToCSV GetElasticsearch PutDropbox
AttributesToJSON GetFile PutDynamoDB
Base64EncodeContent GetFTP PutDynamoDBRecord
CalculateParquetOffsets GetGcpVisionAnnotateFilesOperationStatus PutElasticsearchHttp1
CalculateParquetRowGroupOffsets GetGcpVisionAnnotateImagesOperationStatus PutElasticsearchHttpRecord1
CalculateRecordStats GetHBase PutElasticsearchJson
CaptureChangeDebeziumDB2 [Technical Preview] GetHDFS PutElasticsearchRecord1
CaptureChangeDebeziumMySQL [Technical Preview] GetHDFSFileInfo PutEmail
CaptureChangeDebeziumOracle [Technical Preview] GetHDFSSequenceFile PutFile
CaptureChangeDebeziumPostgreSQL [Technical Preview] GetHTMLElement PutFTP1
CaptureChangeDebeziumSQLServer [Technical Preview] GetHTTP PutGCSObject
CaptureChangeMySQL GetHubSpot PutGoogleDrive
CompressContent1, 2 GetIgniteCache PutGridFS
ConnectWebSocket GetJiraIssue PutHBaseCell
ConsumeAMQP GetJMSQueue PutHBaseJSON
ConsumeAzureEventHub GetJMSTopic PutHBaseRecord1
ConsumeElasticsearch GetMongoRecord PutHDFS
ConsumeEWS GetSFTP PutHive3QL
ConsumeGCPubSub GetShopify PutHive3Streaming
ConsumeGCPubSubLite GetSNMP PutHiveQL
ConsumeJMS GetSnowflakeIngestStatus PutHiveStreaming
ConsumeKafka_1_0 GetSolr PutHTMLElement
ConsumeKafka_2_0 GetSplunk PutIceberg [Technical Preview]
ConsumeKafka_2_6 GetSQS PutIcebergCDC
ConsumeKafka2CDP GetTCP PutInfluxDB
ConsumeKafka2RecordCDP GetTwitter PutJiraIssue
ConsumeKafkaRecord_1_0 GetWorkdayReport PutJMS1
ConsumeKafkaRecord_2_0 GetZendesk PutKinesisFirehose
ConsumeKafkaRecord_2_6 HandleHttpRequest PutKinesisStream
ConsumeKinesisStream HandleHttpResponse PutKudu
ConsumeMQTT1 HashAttribute PutLambda
ConsumeTwitter HashContent PutMongoRecord
ConsumeWindowsEventLog IdentifyMimeType PutORC1
ControlRate InvokeAWSGatewayApi PutParquet
ConvertAvroSchema InvokeGRPC PutRecord
ConvertAvroToJSON InvokeGRPC PutRedisHashRecord [Technical Preview]
ConvertAvroToORC InvokeHTTP PutRiemann
ConvertAvroToParquet InvokeScriptedProcessor PutS3Object
ConvertCharacterSet JoinEnrichment PutSalesforceObject
ConvertCSVToAvro JoltTransformJSON PutSFTP
ConvertJSONToAvro JoltTransformRecord PutSmbFile
ConvertJSONToSQL JSLTTransformJSON PutSnowflakeInternalStage
ConvertProtobuf JsonQueryElasticsearch PutSNS
ConvertRecord ListAzureBlobStorage PutSolrContentStream
CountText [2.1.7 SP2+ only] ListAzureBlobStorage_v12 PutSolrRecord
CreateHadoopSequenceFile ListAzureDataLakeStorage PutSplunk
CryptographicHashAttribute ListBoxFile PutSplunkHTTP
CryptographicHashContent ListCDPObjectStore PutSQL
DecryptContent ListDatabaseTables PutSQS1
DecryptContentAge ListDropbox PutSyslog
DecryptContentCompatibility ListenBeats PutTCP
DecryptContentPGP ListenFTP PutUDP
DeduplicateRecord ListenGRPC* PutWebSocket
DeleteAzureBlobStorage ListenGRPC* PutZendeskTicket
DeleteAzureBlobStorage_v12 ListenHTTP QueryAirtableTable
DeleteAzureDataLakeStorage ListenNetFlow QueryCassandra
DeleteByQueryElasticsearch ListenOTLP QueryDatabaseTable1
DeleteCDPObjectStore ListenRELP QueryDatabaseTableRecord
DeleteDynamoDB ListenSyslog QueryElasticsearchHttp
DeleteGCSObject ListenTCP QueryRecord
DeleteGridFS ListenTCPRecord QuerySalesforceObject
DeleteHBaseCells ListenTrapSNMP QuerySolr
DeleteHBaseRow ListenUDP QuerySplunkIndexingStatus
DeleteHDFS ListenUDPRecord QueryWhois
DeleteS3Object ListenWebSocket RemoveRecordField
DeleteSQS ListFile ReplaceText
DetectDuplicate ListFTP ReplaceTextWithMapping
DistributeLoad ListGCSBucket ResizeImage1
DuplicateFlowFile ListGoogleDrive RetryFlowFile
EncodeContent ListHDFS RouteHL7
EncryptContent2 ListS3 RouteOnAttribute
EncryptContentAge ListSFTP RouteOnContent
EncryptContentPGP ListSmb RouteText
EnforceOrder LogAttribute SampleRecord
EvaluateJsonPath LogMessage ScanAccumulo
EvaluateXPath LookupAttribute ScanAttribute1
EvaluateXQuery LookupRecord ScanContent
ExecuteGroovyScript MergeContent ScanHBase
ExecuteInfluxDBQuery MergeRecord1 ScriptedFilterRecord
ExecuteProcess ModifyCompression ScriptedPartitionRecord
ExecuteScript ModifyHTMLElement ScriptedTransformRecord
ExecuteSQL MonitorActivity ScriptedValidateRecord
ExecuteSQLRecord MoveAzureDataLakeStorage ScrollElasticsearchHttp
ExecuteStateless1,2 MoveHDFS SearchElasticsearch
ExecuteStreamCommand Notify SegmentContent
ExtractAvroMetadata PackageFlowFile SelectClouderaHiveQL
ExtractGrok PaginatedJsonQueryElasticsearch SelectHive3QL1
ExtractHL7Attributes ParseCEF1 SelectHiveQL
ExtractImageMetadata ParseEvtx SendTrapSNMP
ExtractRecordSchema ParseSyslog SetSNMP
ExtractText PartitionRecord SignContentPGP
FetchAzureBlobStorage PostHTTP SplitAvro
FetchAzureBlobStorage_v12 PublishAMQP SplitContent
FetchAzureDataLakeStorage PublishGCPubSub1 SplitJson1
FetchBoxFile PublishGCPubSubLite1 SplitRecord1
FetchCDPObjectStore PublishJMS1 SplitText1
FetchDistributedMapCache PublishKafka_1_0 SplitXml
FetchDropbox PublishKafka_2_0 StartAwsPollyJob
FetchElasticsearchHttp PublishKafka_2_6 StartAwsTextractJob
FetchFile PublishKafka2CDP StartAwsTranscribeJob
FetchFTP PublishKafka2RecordCDP StartAwsTranslateJob
FetchGCSObject PublishKafkaRecord_1_0 StartGcpVisionAnnotateFilesOperation
FetchGoogleDrive PublishKafkaRecord_2_0 StartGcpVisionAnnotateImagesOperation
FetchGridFS PublishKafkaRecord_2_6 StartSnowflakeIngest
FetchHBaseRow PublishMQTT TagS3Object
FetchHDFS PublishSlack TailFile
FetchParquet PutAccumuloRecord1 TransformXml
FetchS3Object PutAzureBlobStorage TriggerClouderaHiveMetaStoreEvent
FetchSFTP PutAzureBlobStorage_v12 TriggerHiveMetaStoreEvent
FetchSmb PutAzureCosmosDBRecord UnpackContent
FilterAttribute PutAzureDataLakeStorage1 UpdateAttribute
FlattenJson PutAzureEventHub UpdateByQueryElasticsearch
ForkEnrichment PutAzureQueueStorage1 UpdateClouderaHiveTable
ForkRecord PutAzureQueueStorage_v12 UpdateCounter
GenerateFlowFile PutBigQuery UpdateDatabaseTable
GenerateRecord PutBigQueryBatch UpdateDeltaLakeTable [Technical Preview]
GenerateTableFetch PutBigQueryStreaming UpdateHive3Table
GeoEnrichIP PutBoxFile UpdateHiveTable
GeoEnrichIPRecord PutCassandraQL1 UpdateRecord
GeohashRecord PutCassandraRecord1 ValidateCsv
GetAsanaObject PutCDPObjectStore ValidateJson
GetAwsPollyJobStatus PutClouderaHiveQL ValidateRecord
GetAwsTextractJobStatus PutClouderaHiveStreaming ValidateXml
GetAwsTranscribeJobStatus PutClouderaORC VerifyContentMAC
GetAwsTranslateJobStatus PutCloudWatchMetric VerifyContentPGP
GetAzureEventHub PutCouchbaseKey Wait
GetAzureQueueStorage PutDatabaseRecord1 YandexTranslate
GetAzureQueueStorage_v12 PutDistributedMapCache
GetCouchbaseKey1

Footnotes

  • 1 – indicates a memory-intensive processor
  • 2 – indicates a CPU-intensive processor

NiFi 2.3.0 in Cloudera Flow Management 4.2.1.400

AttributesToCSV GetBoxGroupMembers PutElasticsearchJson
AttributesToJSON GetCouchbaseKey1 PutElasticsearchRecord1
CalculateParquetOffsets GetElasticsearch PutEmail
CalculateParquetRowGroupOffsets GetFile PutFile
CalculateRecordStats GetFTP PutFTP1
CaptureChangeDebeziumDB2 [Technial Preview] GetGcpVisionAnnotateFilesOperationStatus PutGCSObject
CaptureChangeDebeziumMongoDB [Technial Preview] GetGcpVisionAnnotateImagesOperationStatus PutGoogleDrive
CaptureChangeDebeziumMySQL [Technial Preview] GetHBase PutGridFS
CaptureChangeDebeziumOracle GetHDFS PutHBaseCell
CaptureChangeDebeziumPostgreSQL GetHDFSFileInfo PutHBaseJSON
CaptureChangeDebeziumSQLServer [Technial Preview] GetHDFSSequenceFile PutHBaseRecord1
CaptureChangeMySQL GetHubSpot PutHDFS
ChunkDocument GetJiraIssue PutHive3QL
CompressContent1, 2 GetMongoRecord PutHive3Streaming
ConnectWebSocket GetS3ObjectTags PutIceberg
ConsumeAMQP GetSFTP PutIcebergCDC
ConsumeAzureEventHub GetShopify PutJiraIssue
ConsumeBoxEnterpriseEvents GetSNMP PutKinesisFirehose
ConsumeBoxEvents GetSnowflakeIngestStatus PutKinesisStream
ConsumeElasticsearch GetSolr PutKudu
ConsumeGCPubSub GetSplunk PutLambda
ConsumeGCPubSubLite GetSQS PutMongoBulkOperations
ConsumeJMS GetWorkdayReport PutMongoRecord
ConsumeKafka_2_6 GetZendesk PutORC1
ConsumeKafka2CDP HandleHttpRequest PutParquet
ConsumeKafka2RecordCDP HandleHttpResponse PutPinecone
ConsumeKafkaRecord_2_6 IdentifyMimeType PutPLC [Technial Preview]
ConsumeKinesisStream InvokeAWSGatewayApi PutRecord
ConsumeMQTT1 InvokeGRPC PutRedisHashRecord [Technial Preview]
ConsumePLC [Technial Preview] InvokeHTTP PutS3Object
ConsumeSlack InvokeScriptedProcessor PutSalesforceObject
ConsumeTwitter JoinEnrichment PutSFTP
ConsumeWindowsEventLog JoltTransformJSON PutSmbFile
ControlRate JoltTransformRecord PutSnowflakeInternalStage
ConvertAvroToJSON JSLTTransformJSON PutSNS
ConvertAvroToParquet JsonQueryElasticsearch PutSolrContentStream
ConvertCharacterSet ListAzureBlobStorage_v12 PutSolrRecord
ConvertJSONToSQL ListAzureDataLakeStorage PutSplunk
ConvertProtobuf ListBoxFile PutSplunkHTTP
ConvertRecord ListBoxFileInfo PutSQL
CopyAzureBlobStorage_v12 ListCDPObjectStore PutSQS1
CountText ListDatabaseTables PutSyslog
CreateHadoopSequenceFile ListDropbox PutTCP
CryptographicHashContent ListenBeats PutUDP
DecryptContent ListenFTP PutWebSocket
DecryptContentAge ListenGRPC PutZendeskTicket
DecryptContentCompatibility ListenHTTP QueryAirtableTable
DecryptContentPGP ListenNetFlow QueryCassandra
DeduplicateRecord ListenOTLP QueryChroma
DeleteAzureBlobStorage_v12 ListenRELP QueryDatabaseTable1
DeleteAzureDataLakeStorage ListenSlack QueryDatabaseTableRecord
DeleteByQueryElasticsearch ListenSyslog QueryPinecone
DeleteCDPObjectStore ListenTCP QueryRecord
DeleteDynamoDB ListenTCPRecord QuerySalesforceObject
DeleteGCSObject ListenTrapSNMP QuerySolr
DeleteGridFS ListenUDP QuerySplunkIndexingStatus
DeleteHBaseCells ListenUDPRecord QueryWhois
DeleteHBaseRow ListenWebSocket RemoveRecordField
DeleteHDFS ListFile RenameRecordField
DeleteS3Object ListFTP ReplaceText
DeleteSQS ListGCSBucket ReplaceTextWithMapping
DetectDuplicate ListGoogleDrive ResizeImage1
DistributeLoad ListHDFS RetryFlowFile
DuplicateFlowFile ListS3 RouteHL7
EncodeContent ListSFTP RouteOnAttribute
EncryptContentAge ListSmb RouteOnContent
EncryptContentPGP LogAttribute RouteText
EnforceOrder LogMessage SampleRecord
EvaluateJsonPath LookupAttribute SawmillTransformJSON
EvaluateXPath LookupRecord SawmillTransformRecord
EvaluateXQuery MergeContent ScanAccumulo
ExecuteGroovyScript MergeRecord1 ScanAttribute1
ExecuteProcess ModifyCompression ScanContent
ExecuteScript MonitorActivity ScanHBase
ExecuteSQL MoveAzureDataLakeStorage ScriptedFilterRecord
ExecuteSQLRecord MoveHDFS ScriptedPartitionRecord
ExecuteStateless1, 2 Notify ScriptedTransformRecord
ExecuteStreamCommand PackageFlowFile ScriptedValidateRecord
ExtractAvroMetadata PaginatedJsonQueryElasticsearch SearchElasticsearch
ExtractGrok ParseCEF1 SegmentContent
ExtractHL7Attributes ParseDocument SelectClouderaHiveQL
ExtractImageMetadata ParseEvtx SelectHive3QL1
ExtractRecordSchema ParseSyslog SendTrapSNMP
ExtractText PartitionRecord SetSNMP
FetchAzureBlobStorage_v12 PromptAzureOpenAI SignContentPGP
FetchAzureDataLakeStorage PromptBedrock SplitAvro
FetchBoxFile PromptChatGPT SplitContent
FetchBoxFileInfo PromptClaude SplitJson1
FetchBoxFileRepresentation PromptOpenAI SplitRecord1
FetchCDPObjectStore PublishAMQP SplitText1
FetchDistributedMapCache PublishGCPubSub1 SplitXml
FetchDropbox PublishGCPubSubLite1 StartAwsPollyJob
FetchFile PublishJMS1 StartAwsTextractJob
FetchFTP PublishKafka_2_6 StartAwsTranscribeJob
FetchGCSObject PublishKafka2CDP StartAwsTranslateJob
FetchGoogleDrive PublishKafka2RecordCDP StartGcpVisionAnnotateFilesOperation
FetchGridFS PublishKafkaRecord_2_6 StartGcpVisionAnnotateImagesOperation
FetchHBaseRow PublishMQTT StartSnowflakeIngest
FetchHDFS PublishSlack TagS3Object
FetchParquet PutAccumuloRecord1 TailFile
FetchPLC [Technial Preview] PutAzureBlobStorage_v12 TokenCount
FetchS3Object PutAzureCosmosDBRecord TransformXml
FetchSFTP PutAzureDataLakeStorage1 TriggerClouderaHiveMetaStoreEvent
FetchSmb PutAzureEventHub TriggerHiveMetaStoreEvent
FilterAttribute PutAzureQueueStorage_v12 UnpackContent
FlattenJson PutBigQuery UpdateAttribute
ForkEnrichment PutBoxFile UpdateByQueryElasticsearch
ForkRecord PutCassandraQL1 UpdateClouderaHiveTable
GenerateFlowFile PutCassandraRecord1 UpdateCounter
GenerateRecord PutCDPObjectStore UpdateDatabaseTable
GenerateTableFetch PutChroma UpdateDeltaLakeTable [Technial Preview]
GeoEnrichIP PutClouderaHiveQL UpdateHive3Table
GeoEnrichIPRecord PutClouderaHiveStreaming UpdateRecord
GeohashRecord PutClouderaORC ValidateCsv
GetAsanaObject PutCloudWatchMetric ValidateJson
GetAwsPollyJobStatus PutCouchbaseKey ValidateRecord
GetAwsTextractJobStatus PutDatabaseRecord1 ValidateXml
GetAwsTranscribeJobStatus PutDistributedMapCache VerifyContentMAC
GetAwsTranslateJobStatus PutDropbox VerifyContentPGP
GetAzureEventHub PutDynamoDB Wait
GetAzureQueueStorage_v12 PutDynamoDBRecord YandexTranslate
GetBoxFileCollaborators

Footnotes

  • 1 – indicates a memory-intensive processor
  • 2 – indicates a CPU-intensive processor