RecordReaderFactory
Implementations: Syslog5424Reader
CEFReader
ReaderLookup
CiscoEmblemSyslogMessageReader
CSVReader
GrokReader
SyslogReader
JsonTreeReader
JsonPathReader
XMLReader
AvroReader
JASN1Reader
ExcelReader
ParquetReader
EBCDICRecordReader
WindowsEventLogReader
IPFIXReader
ScriptedReader
Specifies the Controller Service to use for parsing incoming data and determining the data's schema. | Catalog Service | catalog-service | | Controller Service API: IcebergCatalogService Implementations: HiveCatalogService HadoopCatalogService | Specifies the Controller Service to use for handling references to table’s metadata files. |
Catalog Namespace | catalog-namespace | | | The namespace of the catalog. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Table Name | table-name | | | The name of the Iceberg table to write to. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
File Format | file-format | | | File format to use when writing Iceberg data files. If not set, then the 'write.format.default' table property will be used, default value is parquet. |
Maximum File Size | maximum-file-size | | | The maximum size that a file can be, if the file size is exceeded a new file will be generated with the remaining data. If not set, then the 'write.target-file-size-bytes' table property will be used, default value is 512 MB. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Kerberos User Service | kerberos-user-service | | Controller Service API: KerberosUserService Implementations: KerberosKeytabUserService KerberosPasswordUserService KerberosTicketCacheUserService | Specifies the Kerberos User Controller Service that should be used for authenticating with Kerberos. |
Number of Commit Retries | number-of-commit-retries | 10 | | Number of times to retry a commit before failing. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Minimum Commit Wait Time | minimum-commit-wait-time | 100 ms | | Minimum time to wait before retrying a commit. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Maximum Commit Wait Time | maximum-commit-wait-time | 2 sec | | Maximum time to wait before retrying a commit. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Maximum Commit Duration | maximum-commit-duration | 30 sec | | Total retry timeout period for a commit. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Relationships:
Name | Description |
---|
success | A FlowFile is routed to this relationship after the data ingestion was successful. |
failure | A FlowFile is routed to this relationship if the operation failed and retrying the operation will also fail, such as an invalid data or schema. |
Reads Attributes:
None specified.Writes Attributes:
Name | Description |
---|
iceberg.record.count | The number of records in the FlowFile. |
State management:
This component does not store state.Restricted:
This component is not restricted.System Resource Considerations:
None specified.