Supported NiFi processors

Cloudera Flow Management (CFM) is shipped with Apache NiFi and includes a set of processors, most of which are supported by Cloudera. You should be familiar with the available supported processors, and avoid using any unsupported processors in production environments.

Additional processors are developed and tested by the Cloudera community but are not officially supported by Cloudera. Processors are excluded for a variety of reasons, including insufficient reliability or incomplete test case coverage, declaration of non-production readiness by the community at large, and feature deviation from Cloudera best practices.

CFM 2.1.4 supports the following NiFi processors:

AttributesToCSV GetElasticsearch PutDynamoDB1
AttributesToJSON GetFile PutDynamoDBRecord
Base64EncodeContent GetFTP PutElasticsearchHttp1
CalculateRecordStats GetHBase PutElasticsearchHttpRecord
CaptureChangeMySQL GetHDFS PutElasticsearchJson
CompressContent1,2 GetHDFSFileInfo PutElasticsearchRecord
ConnectWebSocket GetHDFSSequenceFile PutEmail1
ConsumeAMQP GetHTMLElement PutFile
ConsumeAzureEventHub GetHTTP PutFTP
ConsumeEWS GetIgniteCache PutGCSObject
ConsumeGCPubSub GetJMSQueue PutGridFS
ConsumeGCPubSubLite GetJMSTopic PutHBaseCell1
ConsumeJMS GetMongoRecord PutHBaseJSON
ConsumeKafka_1_0 GetSFTP PutHBaseRecord
ConsumeKafka_2_0 GetSNMP PutHDFS
ConsumeKafka_2_6 GetSolr PutHive3QL
ConsumeKafka2CDP GetSplunk PutHive3Streaming
ConsumeKafka2RecordCDP GetSQS PutHiveQL
ConsumeKafkaRecord_1_0 GetTCP PutHiveStreaming
ConsumeKafkaRecord_2_0 GetTwitter PutHTMLElement
ConsumeKafkaRecord_2_6 HandleHttpRequest PutInfluxDB
ConsumeKinesisStream HandleHttpResponse PutJMS
ConsumeMQTT1 HashAttribute PutKinesisFirehose
ConsumeWindowsEventLog HashContent PutKinesisStream
ControlRate IdentifyMimeType PutKudu
ConvertAvroSchema InvokeAWSGatewayApi PutLambda
ConvertAvroToJSON InvokeGRPC PutMongoRecord
ConvertAvroToORC InvokeHTTP PutORC
ConvertAvroToParquet InvokeScriptedProcessor PutParquet
ConvertCharacterSet JoinEnrichment PutRecord
ConvertCSVToAvro JoltTransformJSON PutRiemann
ConvertJSONToAvro JoltTransformRecord PutS3Object
ConvertJSONToSQL JsonQueryElasticsearch PutSFTP
ConvertRecord ListAzureBlobStorage PutSNS
CreateHadoopSequenceFile ListAzureBlobStorage_v12 PutSolrContentStream
CryptographicHashAttribute ListAzureDataLakeStorage PutSolrRecord
CryptographicHashContent ListCDPObjectStore PutSplunk
DecryptContentPGP ListDatabaseTables PutSplunkHTTP1
DeduplicateRecord ListenFTP PutSQL
DeleteAzureBlobStorage ListenGRPC PutSQS
DeleteAzureBlobStorage_v12 ListenHTTP PutSyslog
DeleteAzureDataLakeStorage ListenRELP PutTCP
DeleteByQueryElasticsearch ListenSyslog PutUDP
DeleteCDPObjectStore ListenTCP PutWebSocket1
DeleteDynamoDB ListenTCPRecord QueryCassandra
DeleteGCSObject ListenTrapSNMP QueryDatabaseTable
DeleteGridFS ListenUDP QueryDatabaseTableRecord
DeleteHBaseCells ListenUDPRecord QueryElasticsearchHttp
DeleteHBaseRow ListenWebSocket QueryRecord
DeleteHDFS ListFile QuerySalesforceObject
DeleteS3Object ListFTP QuerySolr
DeleteSQS ListGCSBucket QuerySplunkIndexingStatus
DetectDuplicate ListHDFS QueryWhois
DistributeLoad ListS3 ReplaceText1
DuplicateFlowFile ListSFTP ReplaceTextWithMapping
EncryptContent2 LogAttribute ResizeImage
EncryptContentPGP LogMessage RetryFlowFile
EnforceOrder LookupAttribute RouteHL7
EvaluateJsonPath LookupRecord RouteOnAttribute
EvaluateXPath MergeContent1 RouteOnContent
EvaluateXQuery MergeRecord RouteText
ExecuteGroovyScript ModifyHTMLElement SampleRecord1
ExecuteInfluxDBQuery MonitorActivity ScanAccumulo
ExecuteProcess MoveAzureDataLakeStorage ScanAttribute
ExecuteScript Notify ScanContent
ExecuteSQL PaginatedJsonQueryElasticsearch ScanHBase
ExecuteSQLRecord ParseCEF ScriptedFilterRecord
ExecuteStateless ParseEvtx ScriptedPartitionRecord
ExecuteStreamCommand ParseSyslog ScriptedTransformRecord
ExtractAvroMetadata PartitionRecord ScriptedValidateRecord
ExtractGrok PostHTTP ScrollElasticsearchHttp
ExtractHL7Attributes PublishAMQP1 SearchElasticsearch
ExtractImageMetadata PublishGCPubSub1 SegmentContent
ExtractText PublishGCPubSubLite SelectHive3QL
FetchAzureBlobStorage PublishJMS1 SelectHiveQL
FetchAzureBlobStorage_v12 PublishKafka_1_0 SendTrapSNMP
FetchAzureDataLakeStorage PublishKafka_2_0 SetSNMP
FetchCDPObjectStore PublishKafka_2_6 SignContentPGP
FetchDistributedMapCache PublishKafka2CDP SplitAvro1
FetchElasticsearchHttp PublishKafka2RecordCDP SplitContent1
FetchFile PublishKafkaRecord_1_0 SplitJson1
FetchFTP PublishKafkaRecord_2_0 SplitRecord
FetchGCSObject PublishKafkaRecord_2_6 SplitText1
FetchGridFS PublishMQTT1 SplitXml1
FetchHBaseRow PutAccumuloRecord TagS3Object
FetchHDFS PutAzureBlobStorage TailFile
FetchParquet PutAzureBlobStorage_v12 TransformXml
FetchS3Object PutAzureCosmosDBRecord1 UnpackContent
FetchSFTP PutAzureDataLakeStorage UpdateAttribute
FlattenJson PutAzureEventHub1 UpdateByQueryElasticsearch
ForkEnrichment PutAzureQueueStorage UpdateCounter
ForkRecord PutBigQueryBatch UpdateHive3Table
GenerateFlowFile PutBigQueryStreaming1 UpdateHiveTable
GenerateTableFetch PutCassandraQL1 UpdateRecord
GeoEnrichIP PutCassandraRecord ValidateCsv
GeoEnrichIPRecord PutCDPObjectStore ValidateRecord
GeohashRecord PutCloudWatchMetric ValidateXml
GetAzureEventHub PutCouchbaseKey1 VerifyContentPGP
GetAzureQueueStorage PutDatabaseRecord Wait
GetCouchbaseKey1 PutDistributedMapCache YandexTranslate

Footnotes

  • 1 – indicates a memory intensive processor
  • 2 – indicates a CPU intensive processor