Configure the processor for merging records
Learn how you can configure the MergeRecord processor for your ADLS ingest data flow. You can use it to merge together multiple record-oriented flow files into a large flow file that contains all records of your Kafka data input.
In this example the following settings and properties are used:
Scheduling | Description | Example value for ingest data flow |
---|---|---|
Automatically Terminate Relationships | original |
Property | Description | Example value for ingest data flow |
---|---|---|
RecordReader |
Specify the Controller Service to use for reading incoming data. |
CSVReader |
RecordWriter |
Specify the Controller Service to use for writing out the records. |
CSVRecordSetWriter |
Merge Strategy |
Specify the algorithm used to merge records. The Bin-Packing Algorithm generates a FlowFile populated by arbitrarily chosen FlowFiles. |
Bin-Packing Algorithm |
Minimum Number of Records |
Specify the minimum number of records to include in a bin. |
900 |
Maximum Number of Records |
Specify the maximum number of Records to include in a bin. |
1000 |