Supported NiFi processors

Cloudera Flow Management (CFM) is shipped with Apache NiFi and includes a set of processors, most of which are supported by Cloudera. You should be familiar with the available supported processors, and avoid using any unsupported processors in production environments.

Additional processors are developed and tested by the Cloudera community but are not officially supported by Cloudera. Processors are excluded for a variety of reasons, including insufficient reliability or incomplete test case coverage, declaration of non-production readiness by the community at large, and feature deviation from Cloudera best practices.

AttributesToCSV GetElasticsearch PutDropbox
AttributesToJSON GetFile PutDynamoDB
Base64EncodeContent GetFTP PutDynamoDBRecord
CalculateParquetOffsets GetGcpVisionAnnotateFilesOperationStatus PutElasticsearchHttp1
CalculateParquetRowGroupOffsets GetGcpVisionAnnotateImagesOperationStatus PutElasticsearchHttpRecord1
CalculateRecordStats GetHBase PutElasticsearchJson
CaptureChangeDebeziumDB2 [Technical Preview] GetHDFS PutElasticsearchRecord1
CaptureChangeDebeziumMySQL [Technical Preview] GetHDFSFileInfo PutEmail
CaptureChangeDebeziumOracle [Technical Preview] GetHDFSSequenceFile PutFile
CaptureChangeDebeziumPostgreSQL [Technical Preview] GetHTMLElement PutFTP1
CaptureChangeDebeziumSQLServer [Technical Preview] GetHTTP PutGCSObject
CaptureChangeMySQL GetHubSpot PutGoogleDrive
CompressContent1, 2 GetIgniteCache PutGridFS
ConnectWebSocket GetJiraIssue PutHBaseCell
ConsumeAMQP GetJMSQueue PutHBaseJSON
ConsumeAzureEventHub GetJMSTopic PutHBaseRecord1
ConsumeElasticsearch GetMongoRecord PutHDFS
ConsumeEWS GetSFTP PutHive3QL
ConsumeGCPubSub GetShopify PutHive3Streaming
ConsumeGCPubSubLite GetSNMP PutHiveQL
ConsumeJMS GetSnowflakeIngestStatus PutHiveStreaming
ConsumeKafka_1_0 GetSolr PutHTMLElement
ConsumeKafka_2_0 GetSplunk PutIceberg [Technical Preview]
ConsumeKafka_2_6 GetSQS PutIcebergCDC
ConsumeKafka2CDP GetTCP PutInfluxDB
ConsumeKafka2RecordCDP GetTwitter PutJiraIssue
ConsumeKafkaRecord_1_0 GetWorkdayReport PutJMS1
ConsumeKafkaRecord_2_0 GetZendesk PutKinesisFirehose
ConsumeKafkaRecord_2_6 HandleHttpRequest PutKinesisStream
ConsumeKinesisStream HandleHttpResponse PutKudu
ConsumeMQTT1 HashAttribute PutLambda
ConsumeTwitter HashContent PutMongoRecord
ConsumeWindowsEventLog IdentifyMimeType PutORC1
ControlRate InvokeAWSGatewayApi PutParquet
ConvertAvroSchema InvokeGRPC PutRecord
ConvertAvroToJSON InvokeGRPC PutRedisHashRecord [Technical Preview]
ConvertAvroToORC InvokeHTTP PutRiemann
ConvertAvroToParquet InvokeScriptedProcessor PutS3Object
ConvertCharacterSet JoinEnrichment PutSalesforceObject
ConvertCSVToAvro JoltTransformJSON PutSFTP
ConvertJSONToAvro JoltTransformRecord PutSmbFile
ConvertJSONToSQL JSLTTransformJSON PutSnowflakeInternalStage
ConvertProtobuf JsonQueryElasticsearch PutSNS
ConvertRecord ListAzureBlobStorage PutSolrContentStream
CreateHadoopSequenceFile ListAzureBlobStorage_v12 PutSolrRecord
CryptographicHashAttribute ListAzureDataLakeStorage PutSplunk
CryptographicHashContent ListBoxFile PutSplunkHTTP
DecryptContent ListCDPObjectStore PutSQL
DecryptContentAge ListDatabaseTables PutSQS1
DecryptContentCompatibility ListDropbox PutSyslog
DecryptContentPGP ListenBeats PutTCP
DeduplicateRecord ListenFTP PutUDP
DeleteAzureBlobStorage ListenGRPC* PutWebSocket
DeleteAzureBlobStorage_v12 ListenGRPC* PutZendeskTicket
DeleteAzureDataLakeStorage ListenHTTP QueryAirtableTable
DeleteByQueryElasticsearch ListenNetFlow QueryCassandra
DeleteCDPObjectStore ListenOTLP QueryDatabaseTable1
DeleteDynamoDB ListenRELP QueryDatabaseTableRecord
DeleteGCSObject ListenSyslog QueryElasticsearchHttp
DeleteGridFS ListenTCP QueryRecord
DeleteHBaseCells ListenTCPRecord QuerySalesforceObject
DeleteHBaseRow ListenTrapSNMP QuerySolr
DeleteHDFS ListenUDP QuerySplunkIndexingStatus
DeleteS3Object ListenUDPRecord QueryWhois
DeleteSQS ListenWebSocket RemoveRecordField
DetectDuplicate ListFile ReplaceText
DistributeLoad ListFTP ReplaceTextWithMapping
DuplicateFlowFile ListGCSBucket ResizeImage1
EncodeContent ListGoogleDrive RetryFlowFile
EncryptContent2 ListHDFS RouteHL7
EncryptContentAge ListS3 RouteOnAttribute
EncryptContentPGP ListSFTP RouteOnContent
EnforceOrder ListSmb RouteText
EvaluateJsonPath LogAttribute SampleRecord
EvaluateXPath LogMessage ScanAccumulo
EvaluateXQuery LookupAttribute ScanAttribute1
ExecuteGroovyScript LookupRecord ScanContent
ExecuteInfluxDBQuery MergeContent ScanHBase
ExecuteProcess MergeRecord1 ScriptedFilterRecord
ExecuteScript ModifyCompression ScriptedPartitionRecord
ExecuteSQL ModifyHTMLElement ScriptedTransformRecord
ExecuteSQLRecord MonitorActivity ScriptedValidateRecord
ExecuteStateless1,2 MoveAzureDataLakeStorage ScrollElasticsearchHttp
ExecuteStreamCommand MoveHDFS SearchElasticsearch
ExtractAvroMetadata Notify SegmentContent
ExtractGrok PackageFlowFile SelectClouderaHiveQL
ExtractHL7Attributes PaginatedJsonQueryElasticsearch SelectHive3QL1
ExtractImageMetadata ParseCEF1 SelectHiveQL
ExtractRecordSchema ParseEvtx SendTrapSNMP
ExtractText ParseSyslog SetSNMP
FetchAzureBlobStorage PartitionRecord SignContentPGP
FetchAzureBlobStorage_v12 PostHTTP SplitAvro
FetchAzureDataLakeStorage PublishAMQP SplitContent
FetchBoxFile PublishGCPubSub1 SplitJson1
FetchCDPObjectStore PublishGCPubSubLite1 SplitRecord1
FetchDistributedMapCache PublishJMS1 SplitText1
FetchDropbox PublishKafka_1_0 SplitXml
FetchElasticsearchHttp PublishKafka_2_0 StartAwsPollyJob
FetchFile PublishKafka_2_6 StartAwsTextractJob
FetchFTP PublishKafka2CDP StartAwsTranscribeJob
FetchGCSObject PublishKafka2RecordCDP StartAwsTranslateJob
FetchGoogleDrive PublishKafkaRecord_1_0 StartGcpVisionAnnotateFilesOperation
FetchGridFS PublishKafkaRecord_2_0 StartGcpVisionAnnotateImagesOperation
FetchHBaseRow PublishKafkaRecord_2_6 StartSnowflakeIngest
FetchHDFS PublishMQTT TagS3Object
FetchParquet PublishSlack TailFile
FetchS3Object PutAccumuloRecord1 TransformXml
FetchSFTP PutAzureBlobStorage TriggerClouderaHiveMetaStoreEvent
FetchSmb PutAzureBlobStorage_v12 TriggerHiveMetaStoreEvent
FilterAttribute PutAzureCosmosDBRecord UnpackContent
FlattenJson PutAzureDataLakeStorage1 UpdateAttribute
ForkEnrichment PutAzureEventHub UpdateByQueryElasticsearch
ForkRecord PutAzureQueueStorage1 UpdateClouderaHiveTable
GenerateFlowFile PutAzureQueueStorage_v12 UpdateCounter
GenerateRecord PutBigQuery UpdateDatabaseTable
GenerateTableFetch PutBigQueryBatch UpdateDeltaLakeTable [Technical Preview]
GeoEnrichIP PutBigQueryStreaming UpdateHive3Table
GeoEnrichIPRecord PutBoxFile UpdateHiveTable
GeohashRecord PutCassandraQL1 UpdateRecord
GetAsanaObject PutCassandraRecord1 ValidateCsv
GetAwsPollyJobStatus PutCDPObjectStore ValidateJson
GetAwsTextractJobStatus PutClouderaHiveQL ValidateRecord
GetAwsTranscribeJobStatus PutClouderaHiveStreaming ValidateXml
GetAwsTranslateJobStatus PutClouderaORC VerifyContentMAC
GetAzureEventHub PutCloudWatchMetric VerifyContentPGP
GetAzureQueueStorage PutCouchbaseKey Wait
GetAzureQueueStorage_v12 PutDatabaseRecord1 YandexTranslate
GetCouchbaseKey1 PutDistributedMapCache

Footnotes

  • 1 – indicates a memory intensive processor
  • 2 – indicates a CPU intensive processor
  • * – there are two ListenGRPC processors available, one is provided by Apache and the other is provided by Cloudera