PutIceberg

Description

Iceberg is a high-performance format for huge analytic tables. The PutIceberg processor is capable of pushing data into Iceberg tables using different types of Iceberg catalog implementations.

Commit retry properties

Iceberg supports multiple concurrent writes using optimistic concurrency. The processor's commit retry implementation is using exponential backoff with jitter and scale factor 2, and provides the following properties to configure the behaviour according to its usage.

The NiFi side retry logic is built on top of the Iceberg commit retry logic which can be configured through table properties. See more: Table behavior properties

Snapshot summary properties

The processor provides an option to add additional properties to the snapshot summary using dynamic properties. The additional property must have the 'snapshot-property.' prefix in the dynamic property key but the actual entry will be inserted without it. Each snapshot automatically gets the FlowFile's uuid in the 'nifi-flowfile-uuid' summary property.