UpdateDeltaLakeTable

Description:

Updates a DeltaLake table with information of new added files.

Additional Details...

Tags:

deltalake, deltatable, cloud, storage, parquet

Properties:

In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

Display NameAPI NameDefault ValueAllowable ValuesDescription
Hadoop Configuration Resourceshadoop-config-resourcesA file, or a comma separated list of files, which contain the Hadoop configuration (core-site.xml, etc.) for the Hadoop filesystem where files are being added.

This property expects a comma-separated list of file resources.

Supports Expression Language: true (will be evaluated using Environment variables only)
Storage Locationstorage-locationUse this property to set the storage location in use. Example: 's3a://myBucket/myDirectory'. In case the property is not specified, the processor will use the value set in Hadoop Configuration Resources.
Supports Expression Language: true (will be evaluated using Environment variables only)
DeltaLake Table Pathdeltalake-table-pathPath (relative to the root of the target Hadoop file system) where the DeltaLake table is located.
Supports Expression Language: true (will be evaluated using Environment variables only)
Data File URLfile-url${hadoop.file.url}The URL of the file that was added and with which to update the DeltaLake table.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Kerberos User Servicekerberos-user-serviceController Service API:
KerberosUserService
Implementations: KerberosTicketCacheUserService
KerberosKeytabUserService
KerberosPasswordUserService
Specifies the Kerberos User Controller Service that should be used for authenticating with Kerberos.
Number of Commit Retriesnumber-of-commit-retries10Number of times to retry a commit before failing.
Commit Wait Timecommit-wait-time100 msTime to wait before retrying a commit.

Relationships:

NameDescription
successDeltaLake table successfully updated
failureDeltaLake table update failed

Reads Attributes:

None specified.

Writes Attributes:

None specified.

State management:

This component does not store state.

Restricted:

This component is not restricted.

Input requirement:

This component requires an incoming relationship.

System Resource Considerations:

None specified.