-
Processors
-
AttributeRollingWindow 2.3.0.4.10.0.0-147
-
AttributesToCSV 2.3.0.4.10.0.0-147
-
AttributesToJSON 2.3.0.4.10.0.0-147
-
CalculateParquetOffsets 2.3.0.4.10.0.0-147
-
CalculateParquetRowGroupOffsets 2.3.0.4.10.0.0-147
-
CalculateRecordStats 2.3.0.4.10.0.0-147
-
CaptureChangeDebeziumDB2 2.3.0.4.10.0.0-147
-
CaptureChangeDebeziumMongoDB 2.3.0.4.10.0.0-147
-
CaptureChangeDebeziumMySQL 2.3.0.4.10.0.0-147
-
CaptureChangeDebeziumOracle 2.3.0.4.10.0.0-147
-
CaptureChangeDebeziumPostgreSQL 2.3.0.4.10.0.0-147
-
CaptureChangeDebeziumSQLServer 2.3.0.4.10.0.0-147
-
CaptureChangeMySQL 2.3.0.4.10.0.0-147
-
CompressContent 2.3.0.4.10.0.0-147
-
ConnectWebSocket 2.3.0.4.10.0.0-147
-
ConsumeAMQP 2.3.0.4.10.0.0-147
-
ConsumeAzureEventHub 2.3.0.4.10.0.0-147
-
ConsumeBoxEnterpriseEvents 2.3.0.4.10.0.0-147
-
ConsumeBoxEvents 2.3.0.4.10.0.0-147
-
ConsumeElasticsearch 2.3.0.4.10.0.0-147
-
ConsumeGCPubSub 2.3.0.4.10.0.0-147
-
ConsumeIMAP 2.3.0.4.10.0.0-147
-
ConsumeJMS 2.3.0.4.10.0.0-147
-
ConsumeKafka 2.3.0.4.10.0.0-147
-
ConsumeKafka_2_6 2.3.0.4.10.0.0-147
-
ConsumeKafka2CDP 2.3.0.4.10.0.0-147
-
ConsumeKafka2RecordCDP 2.3.0.4.10.0.0-147
-
ConsumeKafkaRecord_2_6 2.3.0.4.10.0.0-147
-
ConsumeKinesisStream 2.3.0.4.10.0.0-147
-
ConsumeMQTT 2.3.0.4.10.0.0-147
-
ConsumePLC 2.3.0.4.10.0.0-147
-
ConsumePOP3 2.3.0.4.10.0.0-147
-
ConsumeSlack 2.3.0.4.10.0.0-147
-
ConsumeTwitter 2.3.0.4.10.0.0-147
-
ConsumeWindowsEventLog 2.3.0.4.10.0.0-147
-
ControlRate 2.3.0.4.10.0.0-147
-
ConvertAvroToParquet 2.3.0.4.10.0.0-147
-
ConvertCharacterSet 2.3.0.4.10.0.0-147
-
ConvertProtobuf 2.3.0.4.10.0.0-147
-
ConvertRecord 2.3.0.4.10.0.0-147
-
CopyAzureBlobStorage_v12 2.3.0.4.10.0.0-147
-
CopyS3Object 2.3.0.4.10.0.0-147
-
CountText 2.3.0.4.10.0.0-147
-
CreateHadoopSequenceFile 2.3.0.4.10.0.0-147
-
CryptographicHashContent 2.3.0.4.10.0.0-147
-
DebugFlow 2.3.0.4.10.0.0-147
-
DecryptContentAge 2.3.0.4.10.0.0-147
-
DecryptContentPGP 2.3.0.4.10.0.0-147
-
DeduplicateRecord 2.3.0.4.10.0.0-147
-
DeleteAzureBlobStorage_v12 2.3.0.4.10.0.0-147
-
DeleteAzureDataLakeStorage 2.3.0.4.10.0.0-147
-
DeleteByQueryElasticsearch 2.3.0.4.10.0.0-147
-
DeleteCDPObjectStore 2.3.0.4.10.0.0-147
-
DeleteDynamoDB 2.3.0.4.10.0.0-147
-
DeleteFile 2.3.0.4.10.0.0-147
-
DeleteGCSObject 2.3.0.4.10.0.0-147
-
DeleteGridFS 2.3.0.4.10.0.0-147
-
DeleteHBaseCells 2.3.0.4.10.0.0-147
-
DeleteHBaseRow 2.3.0.4.10.0.0-147
-
DeleteHDFS 2.3.0.4.10.0.0-147
-
DeleteMongo 2.3.0.4.10.0.0-147
-
DeleteS3Object 2.3.0.4.10.0.0-147
-
DeleteSFTP 2.3.0.4.10.0.0-147
-
DeleteSQS 2.3.0.4.10.0.0-147
-
DetectDuplicate 2.3.0.4.10.0.0-147
-
DistributeLoad 2.3.0.4.10.0.0-147
-
DuplicateFlowFile 2.3.0.4.10.0.0-147
-
EncodeContent 2.3.0.4.10.0.0-147
-
EncryptContentAge 2.3.0.4.10.0.0-147
-
EncryptContentPGP 2.3.0.4.10.0.0-147
-
EnforceOrder 2.3.0.4.10.0.0-147
-
EvaluateJsonPath 2.3.0.4.10.0.0-147
-
EvaluateXPath 2.3.0.4.10.0.0-147
-
EvaluateXQuery 2.3.0.4.10.0.0-147
-
ExecuteGraphQuery 2.3.0.4.10.0.0-147
-
ExecuteGraphQueryRecord 2.3.0.4.10.0.0-147
-
ExecuteGroovyScript 2.3.0.4.10.0.0-147
-
ExecuteProcess 2.3.0.4.10.0.0-147
-
ExecuteScript 2.3.0.4.10.0.0-147
-
ExecuteSparkInteractive 2.3.0.4.10.0.0-147
-
ExecuteSQL 2.3.0.4.10.0.0-147
-
ExecuteSQLRecord 2.3.0.4.10.0.0-147
-
ExecuteStreamCommand 2.3.0.4.10.0.0-147
-
ExtractAvroMetadata 2.3.0.4.10.0.0-147
-
ExtractDocumentText 2.3.0.4.10.0.0-147
-
ExtractEmailAttachments 2.3.0.4.10.0.0-147
-
ExtractEmailHeaders 2.3.0.4.10.0.0-147
-
ExtractGrok 2.3.0.4.10.0.0-147
-
ExtractHL7Attributes 2.3.0.4.10.0.0-147
-
ExtractImageMetadata 2.3.0.4.10.0.0-147
-
ExtractMediaMetadata 2.3.0.4.10.0.0-147
-
ExtractRecordSchema 2.3.0.4.10.0.0-147
-
ExtractText 2.3.0.4.10.0.0-147
-
FetchAzureBlobStorage_v12 2.3.0.4.10.0.0-147
-
FetchAzureDataLakeStorage 2.3.0.4.10.0.0-147
-
FetchBoxFile 2.3.0.4.10.0.0-147
-
FetchBoxFileInfo 2.3.0.4.10.0.0-147
-
FetchBoxFileRepresentation 2.3.0.4.10.0.0-147
-
FetchCDPObjectStore 2.3.0.4.10.0.0-147
-
FetchDistributedMapCache 2.3.0.4.10.0.0-147
-
FetchDropbox 2.3.0.4.10.0.0-147
-
FetchFile 2.3.0.4.10.0.0-147
-
FetchFTP 2.3.0.4.10.0.0-147
-
FetchGCSObject 2.3.0.4.10.0.0-147
-
FetchGoogleDrive 2.3.0.4.10.0.0-147
-
FetchGridFS 2.3.0.4.10.0.0-147
-
FetchHBaseRow 2.3.0.4.10.0.0-147
-
FetchHDFS 2.3.0.4.10.0.0-147
-
FetchParquet 2.3.0.4.10.0.0-147
-
FetchPLC 2.3.0.4.10.0.0-147
-
FetchS3Object 2.3.0.4.10.0.0-147
-
FetchSFTP 2.3.0.4.10.0.0-147
-
FetchSmb 2.3.0.4.10.0.0-147
-
FilterAttribute 2.3.0.4.10.0.0-147
-
FlattenJson 2.3.0.4.10.0.0-147
-
ForkEnrichment 2.3.0.4.10.0.0-147
-
ForkRecord 2.3.0.4.10.0.0-147
-
GenerateFlowFile 2.3.0.4.10.0.0-147
-
GenerateRecord 2.3.0.4.10.0.0-147
-
GenerateTableFetch 2.3.0.4.10.0.0-147
-
GeoEnrichIP 2.3.0.4.10.0.0-147
-
GeoEnrichIPRecord 2.3.0.4.10.0.0-147
-
GeohashRecord 2.3.0.4.10.0.0-147
-
GetAsanaObject 2.3.0.4.10.0.0-147
-
GetAwsPollyJobStatus 2.3.0.4.10.0.0-147
-
GetAwsTextractJobStatus 2.3.0.4.10.0.0-147
-
GetAwsTranscribeJobStatus 2.3.0.4.10.0.0-147
-
GetAwsTranslateJobStatus 2.3.0.4.10.0.0-147
-
GetAzureEventHub 2.3.0.4.10.0.0-147
-
GetAzureQueueStorage_v12 2.3.0.4.10.0.0-147
-
GetBoxFileCollaborators 2.3.0.4.10.0.0-147
-
GetBoxGroupMembers 2.3.0.4.10.0.0-147
-
GetCouchbaseKey 2.3.0.4.10.0.0-147
-
GetDynamoDB 2.3.0.4.10.0.0-147
-
GetElasticsearch 2.3.0.4.10.0.0-147
-
GetFile 2.3.0.4.10.0.0-147
-
GetFileResource 2.3.0.4.10.0.0-147
-
GetFTP 2.3.0.4.10.0.0-147
-
GetGcpVisionAnnotateFilesOperationStatus 2.3.0.4.10.0.0-147
-
GetGcpVisionAnnotateImagesOperationStatus 2.3.0.4.10.0.0-147
-
GetHBase 2.3.0.4.10.0.0-147
-
GetHDFS 2.3.0.4.10.0.0-147
-
GetHDFSEvents 2.3.0.4.10.0.0-147
-
GetHDFSFileInfo 2.3.0.4.10.0.0-147
-
GetHDFSSequenceFile 2.3.0.4.10.0.0-147
-
GetHubSpot 2.3.0.4.10.0.0-147
-
GetJiraIssue 2.3.0.4.10.0.0-147
-
GetMongo 2.3.0.4.10.0.0-147
-
GetMongoRecord 2.3.0.4.10.0.0-147
-
GetS3ObjectMetadata 2.3.0.4.10.0.0-147
-
GetS3ObjectTags 2.3.0.4.10.0.0-147
-
GetSFTP 2.3.0.4.10.0.0-147
-
GetShopify 2.3.0.4.10.0.0-147
-
GetSlackReaction 2.3.0.4.10.0.0-147
-
GetSmbFile 2.3.0.4.10.0.0-147
-
GetSNMP 2.3.0.4.10.0.0-147
-
GetSnowflakeIngestStatus 2.3.0.4.10.0.0-147
-
GetSolr 2.3.0.4.10.0.0-147
-
GetSplunk 2.3.0.4.10.0.0-147
-
GetSQS 2.3.0.4.10.0.0-147
-
GetTCP 2.3.0.4.10.0.0-147
-
GetWorkdayReport 2.3.0.4.10.0.0-147
-
GetZendesk 2.3.0.4.10.0.0-147
-
HandleHttpRequest 2.3.0.4.10.0.0-147
-
HandleHttpResponse 2.3.0.4.10.0.0-147
-
IdentifyMimeType 2.3.0.4.10.0.0-147
-
InvokeGRPC 2.3.0.4.10.0.0-147
-
InvokeHTTP 2.3.0.4.10.0.0-147
-
InvokeScriptedProcessor 2.3.0.4.10.0.0-147
-
ISPEnrichIP 2.3.0.4.10.0.0-147
-
JoinEnrichment 2.3.0.4.10.0.0-147
-
JoltTransformJSON 2.3.0.4.10.0.0-147
-
JoltTransformRecord 2.3.0.4.10.0.0-147
-
JSLTTransformJSON 2.3.0.4.10.0.0-147
-
JsonQueryElasticsearch 2.3.0.4.10.0.0-147
-
ListAzureBlobStorage_v12 2.3.0.4.10.0.0-147
-
ListAzureDataLakeStorage 2.3.0.4.10.0.0-147
-
ListBoxFile 2.3.0.4.10.0.0-147
-
ListBoxFileInfo 2.3.0.4.10.0.0-147
-
ListCDPObjectStore 2.3.0.4.10.0.0-147
-
ListDatabaseTables 2.3.0.4.10.0.0-147
-
ListDropbox 2.3.0.4.10.0.0-147
-
ListenBeats 2.3.0.4.10.0.0-147
-
ListenFTP 2.3.0.4.10.0.0-147
-
ListenGRPC 2.3.0.4.10.0.0-147
-
ListenHTTP 2.3.0.4.10.0.0-147
-
ListenNetFlow 2.3.0.4.10.0.0-147
-
ListenOTLP 2.3.0.4.10.0.0-147
-
ListenSlack 2.3.0.4.10.0.0-147
-
ListenSyslog 2.3.0.4.10.0.0-147
-
ListenTCP 2.3.0.4.10.0.0-147
-
ListenTrapSNMP 2.3.0.4.10.0.0-147
-
ListenUDP 2.3.0.4.10.0.0-147
-
ListenUDPRecord 2.3.0.4.10.0.0-147
-
ListenWebSocket 2.3.0.4.10.0.0-147
-
ListFile 2.3.0.4.10.0.0-147
-
ListFTP 2.3.0.4.10.0.0-147
-
ListGCSBucket 2.3.0.4.10.0.0-147
-
ListGoogleDrive 2.3.0.4.10.0.0-147
-
ListHBaseRegions 2.3.0.4.10.0.0-147
-
ListHDFS 2.3.0.4.10.0.0-147
-
ListS3 2.3.0.4.10.0.0-147
-
ListSFTP 2.3.0.4.10.0.0-147
-
ListSmb 2.3.0.4.10.0.0-147
-
LogAttribute 2.3.0.4.10.0.0-147
-
LogMessage 2.3.0.4.10.0.0-147
-
LookupAttribute 2.3.0.4.10.0.0-147
-
LookupRecord 2.3.0.4.10.0.0-147
-
MergeContent 2.3.0.4.10.0.0-147
-
MergeRecord 2.3.0.4.10.0.0-147
-
ModifyBytes 2.3.0.4.10.0.0-147
-
ModifyCompression 2.3.0.4.10.0.0-147
-
MonitorActivity 2.3.0.4.10.0.0-147
-
MoveAzureDataLakeStorage 2.3.0.4.10.0.0-147
-
MoveHDFS 2.3.0.4.10.0.0-147
-
Notify 2.3.0.4.10.0.0-147
-
PackageFlowFile 2.3.0.4.10.0.0-147
-
PaginatedJsonQueryElasticsearch 2.3.0.4.10.0.0-147
-
ParseEvtx 2.3.0.4.10.0.0-147
-
ParseNetflowv5 2.3.0.4.10.0.0-147
-
ParseSyslog 2.3.0.4.10.0.0-147
-
ParseSyslog5424 2.3.0.4.10.0.0-147
-
PartitionRecord 2.3.0.4.10.0.0-147
-
PublishAMQP 2.3.0.4.10.0.0-147
-
PublishGCPubSub 2.3.0.4.10.0.0-147
-
PublishJMS 2.3.0.4.10.0.0-147
-
PublishKafka 2.3.0.4.10.0.0-147
-
PublishKafka_2_6 2.3.0.4.10.0.0-147
-
PublishKafka2CDP 2.3.0.4.10.0.0-147
-
PublishKafka2RecordCDP 2.3.0.4.10.0.0-147
-
PublishKafkaRecord_2_6 2.3.0.4.10.0.0-147
-
PublishMQTT 2.3.0.4.10.0.0-147
-
PublishSlack 2.3.0.4.10.0.0-147
-
PutAccumuloRecord 2.3.0.4.10.0.0-147
-
PutAzureBlobStorage_v12 2.3.0.4.10.0.0-147
-
PutAzureCosmosDBRecord 2.3.0.4.10.0.0-147
-
PutAzureDataExplorer 2.3.0.4.10.0.0-147
-
PutAzureDataLakeStorage 2.3.0.4.10.0.0-147
-
PutAzureEventHub 2.3.0.4.10.0.0-147
-
PutAzureQueueStorage_v12 2.3.0.4.10.0.0-147
-
PutBigQuery 2.3.0.4.10.0.0-147
-
PutBoxFile 2.3.0.4.10.0.0-147
-
PutCassandraQL 2.3.0.4.10.0.0-147
-
PutCassandraRecord 2.3.0.4.10.0.0-147
-
PutCDPObjectStore 2.3.0.4.10.0.0-147
-
PutClouderaHiveQL 2.3.0.4.10.0.0-147
-
PutClouderaHiveStreaming 2.3.0.4.10.0.0-147
-
PutClouderaORC 2.3.0.4.10.0.0-147
-
PutCloudWatchMetric 2.3.0.4.10.0.0-147
-
PutCouchbaseKey 2.3.0.4.10.0.0-147
-
PutDatabaseRecord 2.3.0.4.10.0.0-147
-
PutDistributedMapCache 2.3.0.4.10.0.0-147
-
PutDropbox 2.3.0.4.10.0.0-147
-
PutDynamoDB 2.3.0.4.10.0.0-147
-
PutDynamoDBRecord 2.3.0.4.10.0.0-147
-
PutElasticsearchJson 2.3.0.4.10.0.0-147
-
PutElasticsearchRecord 2.3.0.4.10.0.0-147
-
PutEmail 2.3.0.4.10.0.0-147
-
PutFile 2.3.0.4.10.0.0-147
-
PutFTP 2.3.0.4.10.0.0-147
-
PutGCSObject 2.3.0.4.10.0.0-147
-
PutGoogleDrive 2.3.0.4.10.0.0-147
-
PutGridFS 2.3.0.4.10.0.0-147
-
PutHBaseCell 2.3.0.4.10.0.0-147
-
PutHBaseJSON 2.3.0.4.10.0.0-147
-
PutHBaseRecord 2.3.0.4.10.0.0-147
-
PutHDFS 2.3.0.4.10.0.0-147
-
PutIceberg 2.3.0.4.10.0.0-147
-
PutIcebergCDC 2.3.0.4.10.0.0-147
-
PutIoTDBRecord 2.3.0.4.10.0.0-147
-
PutJiraIssue 2.3.0.4.10.0.0-147
-
PutKinesisFirehose 2.3.0.4.10.0.0-147
-
PutKinesisStream 2.3.0.4.10.0.0-147
-
PutKudu 2.3.0.4.10.0.0-147
-
PutLambda 2.3.0.4.10.0.0-147
-
PutMongo 2.3.0.4.10.0.0-147
-
PutMongoBulkOperations 2.3.0.4.10.0.0-147
-
PutMongoRecord 2.3.0.4.10.0.0-147
-
PutParquet 2.3.0.4.10.0.0-147
-
PutPLC 2.3.0.4.10.0.0-147
-
PutRecord 2.3.0.4.10.0.0-147
-
PutRedisHashRecord 2.3.0.4.10.0.0-147
-
PutS3Object 2.3.0.4.10.0.0-147
-
PutSalesforceObject 2.3.0.4.10.0.0-147
-
PutSFTP 2.3.0.4.10.0.0-147
-
PutSmbFile 2.3.0.4.10.0.0-147
-
PutSnowflakeInternalStage 2.3.0.4.10.0.0-147
-
PutSNS 2.3.0.4.10.0.0-147
-
PutSolrContentStream 2.3.0.4.10.0.0-147
-
PutSolrRecord 2.3.0.4.10.0.0-147
-
PutSplunk 2.3.0.4.10.0.0-147
-
PutSplunkHTTP 2.3.0.4.10.0.0-147
-
PutSQL 2.3.0.4.10.0.0-147
-
PutSQS 2.3.0.4.10.0.0-147
-
PutSyslog 2.3.0.4.10.0.0-147
-
PutTCP 2.3.0.4.10.0.0-147
-
PutUDP 2.3.0.4.10.0.0-147
-
PutWebSocket 2.3.0.4.10.0.0-147
-
PutZendeskTicket 2.3.0.4.10.0.0-147
-
QueryAirtableTable 2.3.0.4.10.0.0-147
-
QueryAzureDataExplorer 2.3.0.4.10.0.0-147
-
QueryCassandra 2.3.0.4.10.0.0-147
-
QueryDatabaseTable 2.3.0.4.10.0.0-147
-
QueryDatabaseTableRecord 2.3.0.4.10.0.0-147
-
QueryIoTDBRecord 2.3.0.4.10.0.0-147
-
QueryRecord 2.3.0.4.10.0.0-147
-
QuerySalesforceObject 2.3.0.4.10.0.0-147
-
QuerySolr 2.3.0.4.10.0.0-147
-
QuerySplunkIndexingStatus 2.3.0.4.10.0.0-147
-
RemoveRecordField 2.3.0.4.10.0.0-147
-
RenameRecordField 2.3.0.4.10.0.0-147
-
ReplaceText 2.3.0.4.10.0.0-147
-
ReplaceTextWithMapping 2.3.0.4.10.0.0-147
-
ResizeImage 2.3.0.4.10.0.0-147
-
RetryFlowFile 2.3.0.4.10.0.0-147
-
RouteHL7 2.3.0.4.10.0.0-147
-
RouteOnAttribute 2.3.0.4.10.0.0-147
-
RouteOnContent 2.3.0.4.10.0.0-147
-
RouteText 2.3.0.4.10.0.0-147
-
RunMongoAggregation 2.3.0.4.10.0.0-147
-
SampleRecord 2.3.0.4.10.0.0-147
-
SawmillTransformJSON 2.3.0.4.10.0.0-147
-
SawmillTransformRecord 2.3.0.4.10.0.0-147
-
ScanAccumulo 2.3.0.4.10.0.0-147
-
ScanAttribute 2.3.0.4.10.0.0-147
-
ScanContent 2.3.0.4.10.0.0-147
-
ScanHBase 2.3.0.4.10.0.0-147
-
ScriptedFilterRecord 2.3.0.4.10.0.0-147
-
ScriptedPartitionRecord 2.3.0.4.10.0.0-147
-
ScriptedTransformRecord 2.3.0.4.10.0.0-147
-
ScriptedValidateRecord 2.3.0.4.10.0.0-147
-
SearchElasticsearch 2.3.0.4.10.0.0-147
-
SegmentContent 2.3.0.4.10.0.0-147
-
SelectClouderaHiveQL 2.3.0.4.10.0.0-147
-
SendTrapSNMP 2.3.0.4.10.0.0-147
-
SetSNMP 2.3.0.4.10.0.0-147
-
SignContentPGP 2.3.0.4.10.0.0-147
-
SplitAvro 2.3.0.4.10.0.0-147
-
SplitContent 2.3.0.4.10.0.0-147
-
SplitExcel 2.3.0.4.10.0.0-147
-
SplitJson 2.3.0.4.10.0.0-147
-
SplitPCAP 2.3.0.4.10.0.0-147
-
SplitRecord 2.3.0.4.10.0.0-147
-
SplitText 2.3.0.4.10.0.0-147
-
SplitXml 2.3.0.4.10.0.0-147
-
StartAwsPollyJob 2.3.0.4.10.0.0-147
-
StartAwsTextractJob 2.3.0.4.10.0.0-147
-
StartAwsTranscribeJob 2.3.0.4.10.0.0-147
-
StartAwsTranslateJob 2.3.0.4.10.0.0-147
-
StartGcpVisionAnnotateFilesOperation 2.3.0.4.10.0.0-147
-
StartGcpVisionAnnotateImagesOperation 2.3.0.4.10.0.0-147
-
StartSnowflakeIngest 2.3.0.4.10.0.0-147
-
TagS3Object 2.3.0.4.10.0.0-147
-
TailFile 2.3.0.4.10.0.0-147
-
TransformXml 2.3.0.4.10.0.0-147
-
TriggerClouderaHiveMetaStoreEvent 2.3.0.4.10.0.0-147
-
UnpackContent 2.3.0.4.10.0.0-147
-
UpdateAttribute 2.3.0.4.10.0.0-147
-
UpdateByQueryElasticsearch 2.3.0.4.10.0.0-147
-
UpdateClouderaHiveTable 2.3.0.4.10.0.0-147
-
UpdateCounter 2.3.0.4.10.0.0-147
-
UpdateDatabaseTable 2.3.0.4.10.0.0-147
-
UpdateDeltaLakeTable 2.3.0.4.10.0.0-147
-
UpdateJiraIssue 2.3.0.4.10.0.0-147
-
UpdateRecord 2.3.0.4.10.0.0-147
-
ValidateCsv 2.3.0.4.10.0.0-147
-
ValidateJson 2.3.0.4.10.0.0-147
-
ValidateRecord 2.3.0.4.10.0.0-147
-
ValidateXml 2.3.0.4.10.0.0-147
-
VerifyContentMAC 2.3.0.4.10.0.0-147
-
VerifyContentPGP 2.3.0.4.10.0.0-147
-
Wait 2.3.0.4.10.0.0-147
-
-
Controller Services
-
AccumuloService 2.3.0.4.10.0.0-147
-
ActiveMQJMSConnectionFactoryProvider 2.3.0.4.10.0.0-147
-
ADLSCredentialsControllerService 2.3.0.4.10.0.0-147
-
ADLSCredentialsControllerServiceLookup 2.3.0.4.10.0.0-147
-
ADLSIDBrokerCloudCredentialsProviderControllerService 2.3.0.4.10.0.0-147
-
AmazonGlueSchemaRegistry 2.3.0.4.10.0.0-147
-
ApicurioSchemaRegistry 2.3.0.4.10.0.0-147
-
AvroReader 2.3.0.4.10.0.0-147
-
AvroRecordSetWriter 2.3.0.4.10.0.0-147
-
AvroSchemaRegistry 2.3.0.4.10.0.0-147
-
AWSCredentialsProviderControllerService 2.3.0.4.10.0.0-147
-
AWSIDBrokerCloudCredentialsProviderControllerService 2.3.0.4.10.0.0-147
-
AzureBlobIDBrokerCloudCredentialsProviderControllerService 2.3.0.4.10.0.0-147
-
AzureBlobStorageFileResourceService 2.3.0.4.10.0.0-147
-
AzureCosmosDBClientService 2.3.0.4.10.0.0-147
-
AzureDataLakeStorageFileResourceService 2.3.0.4.10.0.0-147
-
AzureEventHubRecordSink 2.3.0.4.10.0.0-147
-
AzureServiceBusJMSConnectionFactoryProvider 2.3.0.4.10.0.0-147
-
AzureStorageCredentialsControllerService_v12 2.3.0.4.10.0.0-147
-
AzureStorageCredentialsControllerServiceLookup_v12 2.3.0.4.10.0.0-147
-
CassandraDistributedMapCache 2.3.0.4.10.0.0-147
-
CassandraSessionProvider 2.3.0.4.10.0.0-147
-
CdpCredentialsProviderControllerService 2.3.0.4.10.0.0-147
-
CdpOauth2AccessTokenProviderControllerService 2.3.0.4.10.0.0-147
-
CEFReader 2.3.0.4.10.0.0-147
-
CiscoEmblemSyslogMessageReader 2.3.0.4.10.0.0-147
-
ClouderaAttributeSchemaReferenceReader 2.3.0.4.10.0.0-147
-
ClouderaAttributeSchemaReferenceWriter 2.3.0.4.10.0.0-147
-
ClouderaEncodedSchemaReferenceReader 2.3.0.4.10.0.0-147
-
ClouderaEncodedSchemaReferenceWriter 2.3.0.4.10.0.0-147
-
ClouderaHiveConnectionPool 2.3.0.4.10.0.0-147
-
ClouderaSchemaRegistry 2.3.0.4.10.0.0-147
-
CMLLookupService 2.3.0.4.10.0.0-147
-
ConfluentEncodedSchemaReferenceReader 2.3.0.4.10.0.0-147
-
ConfluentEncodedSchemaReferenceWriter 2.3.0.4.10.0.0-147
-
ConfluentSchemaRegistry 2.3.0.4.10.0.0-147
-
CouchbaseClusterService 2.3.0.4.10.0.0-147
-
CouchbaseKeyValueLookupService 2.3.0.4.10.0.0-147
-
CouchbaseMapCacheClient 2.3.0.4.10.0.0-147
-
CouchbaseRecordLookupService 2.3.0.4.10.0.0-147
-
CSVReader 2.3.0.4.10.0.0-147
-
CSVRecordLookupService 2.3.0.4.10.0.0-147
-
CSVRecordSetWriter 2.3.0.4.10.0.0-147
-
DatabaseRecordLookupService 2.3.0.4.10.0.0-147
-
DatabaseRecordSink 2.3.0.4.10.0.0-147
-
DatabaseTableSchemaRegistry 2.3.0.4.10.0.0-147
-
DBCPConnectionPool 2.3.0.4.10.0.0-147
-
DBCPConnectionPoolLookup 2.3.0.4.10.0.0-147
-
DeveloperBoxClientService 2.3.0.4.10.0.0-147
-
DistributedMapCacheLookupService 2.3.0.4.10.0.0-147
-
EBCDICRecordReader 2.3.0.4.10.0.0-147
-
ElasticSearchClientServiceImpl 2.3.0.4.10.0.0-147
-
ElasticSearchLookupService 2.3.0.4.10.0.0-147
-
ElasticSearchStringLookupService 2.3.0.4.10.0.0-147
-
EmailRecordSink 2.3.0.4.10.0.0-147
-
EmbeddedHazelcastCacheManager 2.3.0.4.10.0.0-147
-
ExcelReader 2.3.0.4.10.0.0-147
-
ExternalHazelcastCacheManager 2.3.0.4.10.0.0-147
-
FreeFormTextRecordSetWriter 2.3.0.4.10.0.0-147
-
GCPCredentialsControllerService 2.3.0.4.10.0.0-147
-
GCSFileResourceService 2.3.0.4.10.0.0-147
-
GenericPLC4XConnectionPool 2.3.0.4.10.0.0-147
-
GrokReader 2.3.0.4.10.0.0-147
-
HadoopCatalogService 2.3.0.4.10.0.0-147
-
HadoopDBCPConnectionPool 2.3.0.4.10.0.0-147
-
HazelcastMapCacheClient 2.3.0.4.10.0.0-147
-
HBase_2_ClientMapCacheService 2.3.0.4.10.0.0-147
-
HBase_2_ClientService 2.3.0.4.10.0.0-147
-
HBase_2_RecordLookupService 2.3.0.4.10.0.0-147
-
HikariCPConnectionPool 2.3.0.4.10.0.0-147
-
HiveCatalogService 2.3.0.4.10.0.0-147
-
HttpRecordSink 2.3.0.4.10.0.0-147
-
ImpalaConnectionPool 2.3.0.4.10.0.0-147
-
IPFIXReader 2.3.0.4.10.0.0-147
-
IPLookupService 2.3.0.4.10.0.0-147
-
JASN1Reader 2.3.0.4.10.0.0-147
-
JdbcCatalogService 2.3.0.4.10.0.0-147
-
JettyWebSocketClient 2.3.0.4.10.0.0-147
-
JettyWebSocketServer 2.3.0.4.10.0.0-147
-
JiraRecordSink 2.3.0.4.10.0.0-147
-
JMSConnectionFactoryProvider 2.3.0.4.10.0.0-147
-
JndiJmsConnectionFactoryProvider 2.3.0.4.10.0.0-147
-
JsonConfigBasedBoxClientService 2.3.0.4.10.0.0-147
-
JsonPathReader 2.3.0.4.10.0.0-147
-
JsonRecordSetWriter 2.3.0.4.10.0.0-147
-
JsonTreeReader 2.3.0.4.10.0.0-147
-
Kafka3ConnectionService 2.3.0.4.10.0.0-147
-
KafkaRecordSink_2_6 2.3.0.4.10.0.0-147
-
KerberosKeytabUserService 2.3.0.4.10.0.0-147
-
KerberosPasswordUserService 2.3.0.4.10.0.0-147
-
KerberosTicketCacheUserService 2.3.0.4.10.0.0-147
-
KuduLookupService 2.3.0.4.10.0.0-147
-
LivySessionController 2.3.0.4.10.0.0-147
-
LoggingRecordSink 2.3.0.4.10.0.0-147
-
MapCacheClientService 2.3.0.4.10.0.0-147
-
MapCacheServer 2.3.0.4.10.0.0-147
-
MongoDBControllerService 2.3.0.4.10.0.0-147
-
MongoDBLookupService 2.3.0.4.10.0.0-147
-
Neo4JCypherClientService 2.3.0.4.10.0.0-147
-
ParquetReader 2.3.0.4.10.0.0-147
-
ParquetRecordSetWriter 2.3.0.4.10.0.0-147
-
PEMEncodedSSLContextProvider 2.3.0.4.10.0.0-147
-
PhoenixThickConnectionPool 2.3.0.4.10.0.0-147
-
PhoenixThinConnectionPool 2.3.0.4.10.0.0-147
-
PostgreSQLConnectionPool 2.3.0.4.10.0.0-147
-
PropertiesFileLookupService 2.3.0.4.10.0.0-147
-
ProtobufReader 2.3.0.4.10.0.0-147
-
ProxyPLC4XConnectionPool 2.3.0.4.10.0.0-147
-
RabbitMQJMSConnectionFactoryProvider 2.3.0.4.10.0.0-147
-
ReaderLookup 2.3.0.4.10.0.0-147
-
RecordSetWriterLookup 2.3.0.4.10.0.0-147
-
RecordSinkServiceLookup 2.3.0.4.10.0.0-147
-
RedisConnectionPoolService 2.3.0.4.10.0.0-147
-
RedisDistributedMapCacheClientService 2.3.0.4.10.0.0-147
-
RedshiftConnectionPool 2.3.0.4.10.0.0-147
-
RESTCatalogService 2.3.0.4.10.0.0-147
-
RestLookupService 2.3.0.4.10.0.0-147
-
S3FileResourceService 2.3.0.4.10.0.0-147
-
ScriptedLookupService 2.3.0.4.10.0.0-147
-
ScriptedReader 2.3.0.4.10.0.0-147
-
ScriptedRecordSetWriter 2.3.0.4.10.0.0-147
-
ScriptedRecordSink 2.3.0.4.10.0.0-147
-
SetCacheClientService 2.3.0.4.10.0.0-147
-
SetCacheServer 2.3.0.4.10.0.0-147
-
SimpleCsvFileLookupService 2.3.0.4.10.0.0-147
-
SimpleDatabaseLookupService 2.3.0.4.10.0.0-147
-
SimpleKeyValueLookupService 2.3.0.4.10.0.0-147
-
SimpleRedisDistributedMapCacheClientService 2.3.0.4.10.0.0-147
-
SimpleScriptedLookupService 2.3.0.4.10.0.0-147
-
SiteToSiteReportingRecordSink 2.3.0.4.10.0.0-147
-
SlackRecordSink 2.3.0.4.10.0.0-147
-
SmbjClientProviderService 2.3.0.4.10.0.0-147
-
SnowflakeComputingConnectionPool 2.3.0.4.10.0.0-147
-
StandardAsanaClientProviderService 2.3.0.4.10.0.0-147
-
StandardAzureCredentialsControllerService 2.3.0.4.10.0.0-147
-
StandardDatabaseDialectService 2.3.0.4.10.0.0-147
-
StandardDropboxCredentialService 2.3.0.4.10.0.0-147
-
StandardFileResourceService 2.3.0.4.10.0.0-147
-
StandardHashiCorpVaultClientService 2.3.0.4.10.0.0-147
-
StandardHttpContextMap 2.3.0.4.10.0.0-147
-
StandardJiraCredentialService 2.3.0.4.10.0.0-147
-
StandardJsonSchemaRegistry 2.3.0.4.10.0.0-147
-
StandardKustoIngestService 2.3.0.4.10.0.0-147
-
StandardKustoQueryService 2.3.0.4.10.0.0-147
-
StandardOauth2AccessTokenProvider 2.3.0.4.10.0.0-147
-
StandardPGPPrivateKeyService 2.3.0.4.10.0.0-147
-
StandardPGPPublicKeyService 2.3.0.4.10.0.0-147
-
StandardPLC4XConnectionPool 2.3.0.4.10.0.0-147
-
StandardPrivateKeyService 2.3.0.4.10.0.0-147
-
StandardProxyConfigurationService 2.3.0.4.10.0.0-147
-
StandardRestrictedSSLContextService 2.3.0.4.10.0.0-147
-
StandardS3EncryptionService 2.3.0.4.10.0.0-147
-
StandardSnowflakeIngestManagerProviderService 2.3.0.4.10.0.0-147
-
StandardSSLContextService 2.3.0.4.10.0.0-147
-
StandardWebClientServiceProvider 2.3.0.4.10.0.0-147
-
Syslog5424Reader 2.3.0.4.10.0.0-147
-
SyslogReader 2.3.0.4.10.0.0-147
-
TinkerpopClientService 2.3.0.4.10.0.0-147
-
UDPEventRecordSink 2.3.0.4.10.0.0-147
-
VolatileSchemaCache 2.3.0.4.10.0.0-147
-
WindowsEventLogReader 2.3.0.4.10.0.0-147
-
XMLFileLookupService 2.3.0.4.10.0.0-147
-
XMLReader 2.3.0.4.10.0.0-147
-
XMLRecordSetWriter 2.3.0.4.10.0.0-147
-
YamlTreeReader 2.3.0.4.10.0.0-147
-
ZendeskRecordSink 2.3.0.4.10.0.0-147
-
-
Reporting Tasks
-
AzureLogAnalyticsProvenanceReportingTask 2.3.0.4.10.0.0-147
-
AzureLogAnalyticsReportingTask 2.3.0.4.10.0.0-147
-
ControllerStatusReportingTask 2.3.0.4.10.0.0-147
-
MonitorDiskUsage 2.3.0.4.10.0.0-147
-
MonitorMemory 2.3.0.4.10.0.0-147
-
QueryNiFiReportingTask 2.3.0.4.10.0.0-147
-
ReportLineageToAtlas 2.3.0.4.10.0.0-147
-
ScriptedReportingTask 2.3.0.4.10.0.0-147
-
SiteToSiteBulletinReportingTask 2.3.0.4.10.0.0-147
-
SiteToSiteMetricsReportingTask 2.3.0.4.10.0.0-147
-
SiteToSiteProvenanceReportingTask 2.3.0.4.10.0.0-147
-
SiteToSiteStatusReportingTask 2.3.0.4.10.0.0-147
-
-
Parameter Providers
-
AwsSecretsManagerParameterProvider 2.3.0.4.10.0.0-147
-
AzureKeyVaultSecretsParameterProvider 2.3.0.4.10.0.0-147
-
CyberArkConjurParameterProvider 2.3.0.4.10.0.0-147
-
DatabaseParameterProvider 2.3.0.4.10.0.0-147
-
EnvironmentVariableParameterProvider 2.3.0.4.10.0.0-147
-
GcpSecretManagerParameterProvider 2.3.0.4.10.0.0-147
-
HashiCorpVaultParameterProvider 2.3.0.4.10.0.0-147
-
KubernetesSecretParameterProvider 2.3.0.4.10.0.0-147
-
OnePasswordParameterProvider 2.3.0.4.10.0.0-147
-
PropertiesFileParameterProvider 2.3.0.4.10.0.0-147
-
-
Flow Analysis Rules
-
DisallowComponentType 2.3.0.4.10.0.0-147
-
DisallowConsecutiveConnectionsWithRoundRobinLB 2.3.0.4.10.0.0-147
-
DisallowDeadEnd 2.3.0.4.10.0.0-147
-
DisallowDeprecatedProcessor 2.3.0.4.10.0.0-147
-
DisallowExtractTextForFullContent 2.3.0.4.10.0.0-147
-
RecommendRecordProcessor 2.3.0.4.10.0.0-147
-
RequireHandleHttpResponseAfterHandleHttpRequest 2.3.0.4.10.0.0-147
-
RequireMergeBeforePutIceberg 2.3.0.4.10.0.0-147
-
RestrictBackpressureSettings 2.3.0.4.10.0.0-147
-
RestrictComponentNaming 2.3.0.4.10.0.0-147
-
RestrictConcurrentTasksVsThreadPoolSizeInProcessors 2.3.0.4.10.0.0-147
-
RestrictFlowFileExpiration 2.3.0.4.10.0.0-147
-
RestrictProcessorConcurrency 2.3.0.4.10.0.0-147
-
RestrictSchedulingForListProcessors 2.3.0.4.10.0.0-147
-
RestrictThreadPoolSize 2.3.0.4.10.0.0-147
-
RestrictYieldDurationForConsumeKafkaProcessors 2.3.0.4.10.0.0-147
-
ListHDFS 2.3.0.4.10.0.0-147
- Bundle
- org.apache.nifi | nifi-hadoop-nar
- Description
- Retrieves a listing of files from HDFS. For each file that is listed in HDFS, this processor creates a FlowFile that represents the HDFS file to be fetched in conjunction with FetchHDFS. This Processor is designed to run on Primary Node only in a cluster. If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all of the data. Unlike GetHDFS, this Processor does not delete any data from HDFS.
- Tags
- HCFS, HDFS, filesystem, get, hadoop, ingest, list, source
- Input Requirement
- FORBIDDEN
- Supports Sensitive Dynamic Properties
- false
-
Additional Details for ListHDFS 2.3.0.4.10.0.0-147
ListHDFS
ListHDFS Filter Modes
There are three filter modes available for ListHDFS that determine how the regular expression in the
File Filter
property will be applied to listings in HDFS.-
Directories and Files Filtering will be applied to the names of directories and files. If
Recurse Subdirectories
is set to true, only subdirectories with a matching name will be searched for files that match the regular expression defined inFile Filter
. -
Files Only Filtering will only be applied to the names of files. If
Recurse Subdirectories
is set to true, the entire subdirectory tree will be searched for files that match the regular expression defined inFile Filter
. -
Full Path Filtering will be applied to the full path of files. If
Recurse Subdirectories
is set to true, the entire subdirectory tree will be searched for files in which the full path of the file matches the regular expression defined inFile Filter
. Regardingscheme
andauthority
, if a given file has a full path ofhdfs://hdfscluster:8020/data/txt/1.txt
, the filter will evaluate the regular expression defined inFile Filter
against two cases, matching if either is true: -
the full path including the scheme (
hdfs
), authority (hdfscluster:8020
), and the remaining path components (/data/txt/1.txt
) -
only the path components (
/data/txt/1.txt
)
Examples:
For the given examples, the following directory structure is used:
data
├── readme.txt
├── bin
│ ├── readme.txt
│ ├── 1.bin
│ ├── 2.bin
│ └── 3.bin
├── csv
│ ├── readme.txt
│ ├── 1.csv
│ ├── 2.csv
│ └── 3.csv
└── txt ├── readme.txt ├── 1.txt ├── 2.txt └── 3.txtDirectories and Files
This mode is useful when the listing should match the names of directories and files with the regular expression defined in
File Filter
. WhenRecurse Subdirectories
is true, this mode allows the user to filter for files in subdirectories with names that match the regular expression defined inFile Filter
.ListHDFS configuration:
Property Value Directory
/data
Recurse Subdirectories
true File Filter
.*txt.*
Filter Mode
Directories and Files
ListHDFS results:
- /data/readme.txt
- /data/txt/readme.txt
- /data/txt/1.txt
- /data/txt/2.txt
- /data/txt/3.txt
Files Only
This mode is useful when the listing should match only the names of files with the regular expression defined in
File Filter
. Directory names will not be matched against the regular expression defined inFile Filter
. WhenRecurse Subdirectories
is true, this mode allows the user to filter for files in the entire subdirectory tree of the directory specified in theDirectory
property.ListHDFS configuration:
Property Value Directory
/data
Recurse Subdirectories
true File Filter
[^\.].*\.txt
Filter Mode
Files Only
ListHDFS results:
- /data/readme.txt
- /data/bin/readme.txt
- /data/csv/readme.txt
- /data/txt/readme.txt
- /data/txt/1.txt
- /data/txt/2.txt
- /data/txt/3.txt
Full Path
This mode is useful when the listing should match the entire path of a file with the regular expression defined in
File Filter
. WhenRecurse Subdirectories
is true, this mode allows the user to filter for files in the entire subdirectory tree of the directory specified in theDirectory
property while allowing filtering based on the full path of each file.ListHDFS configuration:
Property Value Directory
/data
Recurse Subdirectories
true File Filter
(/.*/)*csv/.*
Filter Mode
Full Path
ListHDFS results:
- /data/csv/readme.txt
- /data/csv/1.csv
- /data/csv/2.csv
- /data/csv/3.csv
Streaming Versus Batch Processing
ListHDFS performs a listing of all files that it encounters in the configured HDFS directory. There are two common, broadly defined use cases.
Streaming Use Case
By default, the Processor will create a separate FlowFile for each file in the directory and add attributes for filename, path, etc. A common use case is to connect ListHDFS to the FetchHDFS processor. These two processors used in conjunction with one another provide the ability to easily monitor a directory and fetch the contents of any new file as it lands in HDFS in an efficient streaming fashion.
Batch Use Case
Another common use case is the desire to process all newly arriving files in a given directory, and to then perform some action only when all files have completed their processing. The above approach of streaming the data makes this difficult, because NiFi is inherently a streaming platform in that there is no “job” that has a beginning and an end. Data is simply picked up as it becomes available.
To solve this, the ListHDFS Processor can optionally be configured with a Record Writer. When a Record Writer is configured, a single FlowFile will be created that will contain a Record for each file in the directory, instead of a separate FlowFile per file. See the documentation for ListFile for an example of how to build a dataflow that allows for processing all the files before proceeding with any other step.
One important difference between the data produced by ListFile and ListHDFS, though, is the structure of the Records that are emitted. The Records emitted by ListFile have a different schema than those emitted by ListHDFS. ListHDFS emits records that follow the following schema (in Avro format):
{ "type": "record", "name": "nifiRecord", "namespace": "org.apache.nifi", "fields": [ { "name": "filename", "type": "string" }, { "name": "path", "type": "string" }, { "name": "directory", "type": "boolean" }, { "name": "size", "type": "long" }, { "name": "lastModified", "type": { "type": "long", "logicalType": "timestamp-millis" } }, { "name": "permissions", "type": [ "null", "string" ] }, { "name": "owner", "type": [ "null", "string" ] }, { "name": "group", "type": [ "null", "string" ] }, { "name": "replication", "type": [ "null", "int" ] }, { "name": "symLink", "type": [ "null", "boolean" ] }, { "name": "encrypted", "type": [ "null", "boolean" ] }, { "name": "erasureCoded", "type": [ "null", "boolean" ] } ] }
-
-
Additional Classpath Resources
A comma-separated list of paths to files and/or directories that will be added to the classpath and used for loading native libraries. When specifying a directory, all files with in the directory will be added to the classpath, but further sub-directories will not be included.
- Display Name
- Additional Classpath Resources
- Description
- A comma-separated list of paths to files and/or directories that will be added to the classpath and used for loading native libraries. When specifying a directory, all files with in the directory will be added to the classpath, but further sub-directories will not be included.
- API Name
- Additional Classpath Resources
- Expression Language Scope
- Not Supported
- Sensitive
- false
- Required
- false
-
Directory
The HDFS directory from which files should be read
- Display Name
- Directory
- Description
- The HDFS directory from which files should be read
- API Name
- Directory
- Expression Language Scope
- Environment variables and FlowFile Attributes
- Sensitive
- false
- Required
- true
-
File Filter
Only files whose names match the given regular expression will be picked up
- Display Name
- File Filter
- Description
- Only files whose names match the given regular expression will be picked up
- API Name
- File Filter
- Default Value
- [^\.].*
- Expression Language Scope
- Not Supported
- Sensitive
- false
- Required
- true
-
File Filter Mode
Determines how the regular expression in File Filter will be used when retrieving listings.
- Display Name
- File Filter Mode
- Description
- Determines how the regular expression in File Filter will be used when retrieving listings.
- API Name
- file-filter-mode
- Default Value
- filter-mode-directories-and-files
- Allowable Values
-
- Directories and Files
- Files Only
- Full Path
- Expression Language Scope
- Not Supported
- Sensitive
- false
- Required
- true
-
Hadoop Configuration Resources
A file or comma separated list of files which contains the Hadoop file system configuration. Without this, Hadoop will search the classpath for a 'core-site.xml' and 'hdfs-site.xml' file or will revert to a default configuration. To use swebhdfs, see 'Additional Details' section of PutHDFS's documentation.
- Display Name
- Hadoop Configuration Resources
- Description
- A file or comma separated list of files which contains the Hadoop file system configuration. Without this, Hadoop will search the classpath for a 'core-site.xml' and 'hdfs-site.xml' file or will revert to a default configuration. To use swebhdfs, see 'Additional Details' section of PutHDFS's documentation.
- API Name
- Hadoop Configuration Resources
- Expression Language Scope
- Environment variables defined at JVM level and system properties
- Sensitive
- false
- Required
- false
-
Kerberos User Service
Specifies the Kerberos User Controller Service that should be used for authenticating with Kerberos
- Display Name
- Kerberos User Service
- Description
- Specifies the Kerberos User Controller Service that should be used for authenticating with Kerberos
- API Name
- kerberos-user-service
- Service Interface
- org.apache.nifi.kerberos.KerberosUserService
- Service Implementations
- Expression Language Scope
- Not Supported
- Sensitive
- false
- Required
- false
-
Maximum File Age
The maximum age that a file must be in order to be pulled; any file older than this amount of time (based on last modification date) will be ignored. Minimum value is 100ms.
- Display Name
- Maximum File Age
- Description
- The maximum age that a file must be in order to be pulled; any file older than this amount of time (based on last modification date) will be ignored. Minimum value is 100ms.
- API Name
- maximum-file-age
- Expression Language Scope
- Not Supported
- Sensitive
- false
- Required
- false
-
Minimum File Age
The minimum age that a file must be in order to be pulled; any file younger than this amount of time (based on last modification date) will be ignored
- Display Name
- Minimum File Age
- Description
- The minimum age that a file must be in order to be pulled; any file younger than this amount of time (based on last modification date) will be ignored
- API Name
- minimum-file-age
- Expression Language Scope
- Not Supported
- Sensitive
- false
- Required
- false
-
Record Writer
Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile.
- Display Name
- Record Writer
- Description
- Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile.
- API Name
- record-writer
- Service Interface
- org.apache.nifi.serialization.RecordSetWriterFactory
- Service Implementations
- Expression Language Scope
- Not Supported
- Sensitive
- false
- Required
- false
-
Recurse Subdirectories
Indicates whether to list files from subdirectories of the HDFS directory
- Display Name
- Recurse Subdirectories
- Description
- Indicates whether to list files from subdirectories of the HDFS directory
- API Name
- Recurse Subdirectories
- Default Value
- true
- Allowable Values
-
- true
- false
- Expression Language Scope
- Not Supported
- Sensitive
- false
- Required
- true
Scopes | Description |
---|---|
CLUSTER | After performing a listing of HDFS files, the latest timestamp of all the files listed is stored. This allows the Processor to list only files that have been added or modified after this date the next time that the Processor is run, without having to store all of the actual filenames/paths which could lead to performance problems. State is stored across the cluster so that this Processor can be run on Primary Node only and if a new Primary Node is selected, the new node can pick up where the previous node left off, without duplicating the data. |
Name | Description |
---|---|
success | All FlowFiles are transferred to this relationship |
Name | Description |
---|---|
filename | The name of the file that was read from HDFS. |
path | The path is set to the absolute path of the file's directory on HDFS. For example, if the Directory property is set to /tmp, then files picked up from /tmp will have the path attribute set to "./". If the Recurse Subdirectories property is set to true and a file is picked up from /tmp/abc/1/2/3, then the path attribute will be set to "/tmp/abc/1/2/3". |
hdfs.owner | The user that owns the file in HDFS |
hdfs.group | The group that owns the file in HDFS |
hdfs.lastModified | The timestamp of when the file in HDFS was last modified, as milliseconds since midnight Jan 1, 1970 UTC |
hdfs.length | The number of bytes in the file in HDFS |
hdfs.replication | The number of HDFS replicas for hte file |
hdfs.permissions | The permissions for the file in HDFS. This is formatted as 3 characters for the owner, 3 for the group, and 3 for other users. For example rw-rw-r-- |