Supported NiFi processors

Cloudera Flow Management is shipped with Apache NiFi and includes a set of processors, most of which are supported by Cloudera. You should be familiar with the available supported processors, and avoid using any unsupported processors in production environments.

Additional processors are developed and tested by the Cloudera community but are not officially supported by Cloudera. Processors are excluded for a variety of reasons, including insufficient reliability or incomplete test case coverage, declaration of non-production readiness by the community at large, and feature deviation from Cloudera best practices.

AttributesToCSV GetElasticsearch PutDropbox
AttributesToJSON GetFile PutDynamoDB
Base64EncodeContent GetFTP PutDynamoDBRecord
CalculateParquetOffsets GetGcpVisionAnnotateFilesOperationStatus PutElasticsearchHttp1
CalculateParquetRowGroupOffsets GetGcpVisionAnnotateImagesOperationStatus PutElasticsearchHttpRecord1
CalculateRecordStats GetHBase PutElasticsearchJson
CaptureChangeDebeziumDB2 [Technical Preview] GetHDFS PutElasticsearchRecord1
CaptureChangeDebeziumMySQL [Technical Preview] GetHDFSFileInfo PutEmail
CaptureChangeDebeziumOracle [Technical Preview] GetHDFSSequenceFile PutFile
CaptureChangeDebeziumPostgreSQL [Technical Preview] GetHTMLElement PutFTP1
CaptureChangeDebeziumSQLServer [Technical Preview] GetHTTP PutGCSObject
CaptureChangeMySQL GetHubSpot PutGoogleDrive
CompressContent1, 2 GetIgniteCache PutGridFS
ConnectWebSocket GetJiraIssue PutHBaseCell
ConsumeAMQP GetJMSQueue PutHBaseJSON
ConsumeAzureEventHub GetJMSTopic PutHBaseRecord1
ConsumeElasticsearch GetMongoRecord PutHDFS
ConsumeEWS GetSFTP PutHive3QL
ConsumeGCPubSub GetShopify PutHive3Streaming
ConsumeGCPubSubLite GetSNMP PutHiveQL
ConsumeJMS GetSnowflakeIngestStatus PutHiveStreaming
ConsumeKafka_1_0 GetSolr PutHTMLElement
ConsumeKafka_2_0 GetSplunk PutIceberg [Technical Preview]
ConsumeKafka_2_6 GetSQS PutIcebergCDC
ConsumeKafka2CDP GetTCP PutInfluxDB
ConsumeKafka2RecordCDP GetTwitter PutJiraIssue
ConsumeKafkaRecord_1_0 GetWorkdayReport PutJMS1
ConsumeKafkaRecord_2_0 GetZendesk PutKinesisFirehose
ConsumeKafkaRecord_2_6 HandleHttpRequest PutKinesisStream
ConsumeKinesisStream HandleHttpResponse PutKudu
ConsumeMQTT1 HashAttribute PutLambda
ConsumeTwitter HashContent PutMongoRecord
ConsumeWindowsEventLog IdentifyMimeType PutORC1
ControlRate InvokeAWSGatewayApi PutParquet
ConvertAvroSchema InvokeGRPC PutRecord
ConvertAvroToJSON InvokeGRPC PutRedisHashRecord [Technical Preview]
ConvertAvroToORC InvokeHTTP PutRiemann
ConvertAvroToParquet InvokeScriptedProcessor PutS3Object
ConvertCharacterSet JoinEnrichment PutSalesforceObject
ConvertCSVToAvro JoltTransformJSON PutSFTP
ConvertJSONToAvro JoltTransformRecord PutSmbFile
ConvertJSONToSQL JSLTTransformJSON PutSnowflakeInternalStage
ConvertProtobuf JsonQueryElasticsearch PutSNS
ConvertRecord ListAzureBlobStorage PutSolrContentStream
CountText [2.1.7 SP2+ only] ListAzureBlobStorage_v12 PutSolrRecord
CreateHadoopSequenceFile ListAzureDataLakeStorage PutSplunk
CryptographicHashAttribute ListBoxFile PutSplunkHTTP
CryptographicHashContent ListCDPObjectStore PutSQL
DecryptContent ListDatabaseTables PutSQS1
DecryptContentAge ListDropbox PutSyslog
DecryptContentCompatibility ListenBeats PutTCP
DecryptContentPGP ListenFTP PutUDP
DeduplicateRecord ListenGRPC* PutWebSocket
DeleteAzureBlobStorage ListenGRPC* PutZendeskTicket
DeleteAzureBlobStorage_v12 ListenHTTP QueryAirtableTable
DeleteAzureDataLakeStorage ListenNetFlow QueryCassandra
DeleteByQueryElasticsearch ListenOTLP QueryDatabaseTable1
DeleteCDPObjectStore ListenRELP QueryDatabaseTableRecord
DeleteDynamoDB ListenSyslog QueryElasticsearchHttp
DeleteGCSObject ListenTCP QueryRecord
DeleteGridFS ListenTCPRecord QuerySalesforceObject
DeleteHBaseCells ListenTrapSNMP QuerySolr
DeleteHBaseRow ListenUDP QuerySplunkIndexingStatus
DeleteHDFS ListenUDPRecord QueryWhois
DeleteS3Object ListenWebSocket RemoveRecordField
DeleteSQS ListFile ReplaceText
DetectDuplicate ListFTP ReplaceTextWithMapping
DistributeLoad ListGCSBucket ResizeImage1
DuplicateFlowFile ListGoogleDrive RetryFlowFile
EncodeContent ListHDFS RouteHL7
EncryptContent2 ListS3 RouteOnAttribute
EncryptContentAge ListSFTP RouteOnContent
EncryptContentPGP ListSmb RouteText
EnforceOrder LogAttribute SampleRecord
EvaluateJsonPath LogMessage ScanAccumulo
EvaluateXPath LookupAttribute ScanAttribute1
EvaluateXQuery LookupRecord ScanContent
ExecuteGroovyScript MergeContent ScanHBase
ExecuteInfluxDBQuery MergeRecord1 ScriptedFilterRecord
ExecuteProcess ModifyCompression ScriptedPartitionRecord
ExecuteScript ModifyHTMLElement ScriptedTransformRecord
ExecuteSQL MonitorActivity ScriptedValidateRecord
ExecuteSQLRecord MoveAzureDataLakeStorage ScrollElasticsearchHttp
ExecuteStateless1,2 MoveHDFS SearchElasticsearch
ExecuteStreamCommand Notify SegmentContent
ExtractAvroMetadata PackageFlowFile SelectClouderaHiveQL
ExtractGrok PaginatedJsonQueryElasticsearch SelectHive3QL1
ExtractHL7Attributes ParseCEF1 SelectHiveQL
ExtractImageMetadata ParseEvtx SendTrapSNMP
ExtractRecordSchema ParseSyslog SetSNMP
ExtractText PartitionRecord SignContentPGP
FetchAzureBlobStorage PostHTTP SplitAvro
FetchAzureBlobStorage_v12 PublishAMQP SplitContent
FetchAzureDataLakeStorage PublishGCPubSub1 SplitJson1
FetchBoxFile PublishGCPubSubLite1 SplitRecord1
FetchCDPObjectStore PublishJMS1 SplitText1
FetchDistributedMapCache PublishKafka_1_0 SplitXml
FetchDropbox PublishKafka_2_0 StartAwsPollyJob
FetchElasticsearchHttp PublishKafka_2_6 StartAwsTextractJob
FetchFile PublishKafka2CDP StartAwsTranscribeJob
FetchFTP PublishKafka2RecordCDP StartAwsTranslateJob
FetchGCSObject PublishKafkaRecord_1_0 StartGcpVisionAnnotateFilesOperation
FetchGoogleDrive PublishKafkaRecord_2_0 StartGcpVisionAnnotateImagesOperation
FetchGridFS PublishKafkaRecord_2_6 StartSnowflakeIngest
FetchHBaseRow PublishMQTT TagS3Object
FetchHDFS PublishSlack TailFile
FetchParquet PutAccumuloRecord1 TransformXml
FetchS3Object PutAzureBlobStorage TriggerClouderaHiveMetaStoreEvent
FetchSFTP PutAzureBlobStorage_v12 TriggerHiveMetaStoreEvent
FetchSmb PutAzureCosmosDBRecord UnpackContent
FilterAttribute PutAzureDataLakeStorage1 UpdateAttribute
FlattenJson PutAzureEventHub UpdateByQueryElasticsearch
ForkEnrichment PutAzureQueueStorage1 UpdateClouderaHiveTable
ForkRecord PutAzureQueueStorage_v12 UpdateCounter
GenerateFlowFile PutBigQuery UpdateDatabaseTable
GenerateRecord PutBigQueryBatch UpdateDeltaLakeTable [Technical Preview]
GenerateTableFetch PutBigQueryStreaming UpdateHive3Table
GeoEnrichIP PutBoxFile UpdateHiveTable
GeoEnrichIPRecord PutCassandraQL1 UpdateRecord
GeohashRecord PutCassandraRecord1 ValidateCsv
GetAsanaObject PutCDPObjectStore ValidateJson
GetAwsPollyJobStatus PutClouderaHiveQL ValidateRecord
GetAwsTextractJobStatus PutClouderaHiveStreaming ValidateXml
GetAwsTranscribeJobStatus PutClouderaORC VerifyContentMAC
GetAwsTranslateJobStatus PutCloudWatchMetric VerifyContentPGP
GetAzureEventHub PutCouchbaseKey Wait
GetAzureQueueStorage PutDatabaseRecord1 YandexTranslate
GetAzureQueueStorage_v12 PutDistributedMapCache
GetCouchbaseKey1

Footnotes

  • 1 – indicates a memory intensive processor
  • 2 – indicates a CPU intensive processor
  • * – there are two ListenGRPC processors available, one is provided by Apache and the other is provided by Cloudera