This property expects a comma-separated list of file resources.
Supports Expression Language: true (will be evaluated using Environment variables only)Kerberos Credentials Service | kerberos-credentials-service | | Controller Service API: KerberosCredentialsService Implementation: KeytabCredentialsService | Specifies the Kerberos Credentials Controller Service that should be used for authenticating with Kerberos |
Kerberos User Service | kerberos-user-service | | Controller Service API: KerberosUserService Implementations: KerberosTicketCacheUserService KerberosKeytabUserService KerberosPasswordUserService | Specifies the Kerberos User Controller Service that should be used for authenticating with Kerberos |
Kerberos Principal | Kerberos Principal | | | Kerberos principal to authenticate as. Requires nifi.kerberos.krb5.file to be set in your nifi.properties Supports Expression Language: true (will be evaluated using Environment variables only) |
Kerberos Keytab | Kerberos Keytab | | | Kerberos keytab associated with the principal. Requires nifi.kerberos.krb5.file to be set in your nifi.properties
This property requires exactly one file to be provided..
Supports Expression Language: true (will be evaluated using Environment variables only) |
Kerberos Password | Kerberos Password | | | Kerberos password associated with the principal. Sensitive Property: true |
Kerberos Relogin Period | Kerberos Relogin Period | 4 hours | | Period of time which should pass before attempting a kerberos relogin.
This property has been deprecated, and has no effect on processing. Relogins now occur automatically. Supports Expression Language: true (will be evaluated using Environment variables only) |
Additional Classpath Resources | Additional Classpath Resources | | | A comma-separated list of paths to files and/or directories that will be added to the classpath and used for loading native libraries. When specifying a directory, all files with in the directory will be added to the classpath, but further sub-directories will not be included.
This property expects a comma-separated list of resources. Each of the resources may be of any of the following types: directory, file.
|
Record Reader | record-reader | | Controller Service API: RecordReaderFactory Implementations: JASN1Reader JsonTreeReader GrokReader Syslog5424Reader CiscoEmblemSyslogMessageReader AvroReader JsonPathReader CEFReader IPFIXReader WindowsEventLogReader XMLReader ScriptedReader ReaderLookup YamlTreeReader ParquetReader CSVReader EBCDICRecordReader ExcelReader SyslogReader | The service for reading records from incoming flow files. |
Directory | Directory | | | The parent directory to which files should be written. Will be created if it doesn't exist. Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables) |
Compression Type | compression-type | UNCOMPRESSED | - UNCOMPRESSED
- SNAPPY
- GZIP
- LZO
- BROTLI
- LZ4
- ZSTD
- LZ4_RAW
| The type of compression for the file being written. |
Overwrite Files | overwrite | false | | Whether or not to overwrite existing files in the same directory with the same name. When set to false, flow files will be routed to failure when a file exists in the same directory with the same name. |
Permissions umask | permissions-umask | | | A umask represented as an octal number which determines the permissions of files written to HDFS. This overrides the Hadoop Configuration dfs.umaskmode |
Remote Group | remote-group | | | Changes the group of the HDFS file to this value after it is written. This only works if NiFi is running as a user that has HDFS super user privilege to change group |
Remote Owner | remote-owner | | | Changes the owner of the HDFS file to this value after it is written. This only works if NiFi is running as a user that has HDFS super user privilege to change owner |
Row Group Size | row-group-size | | | The row group size used by the Parquet writer. The value is specified in the format of <Data Size> <Data Unit> where Data Unit is one of B, KB, MB, GB, TB. Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables) |
Page Size | page-size | | | The page size used by the Parquet writer. The value is specified in the format of <Data Size> <Data Unit> where Data Unit is one of B, KB, MB, GB, TB. Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables) |
Dictionary Page Size | dictionary-page-size | | | The dictionary page size used by the Parquet writer. The value is specified in the format of <Data Size> <Data Unit> where Data Unit is one of B, KB, MB, GB, TB. Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables) |
Max Padding Size | max-padding-size | | | The maximum amount of padding that will be used to align row groups with blocks in the underlying filesystem. If the underlying filesystem is not a block filesystem like HDFS, this has no effect. The value is specified in the format of <Data Size> <Data Unit> where Data Unit is one of B, KB, MB, GB, TB. Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables) |
Enable Dictionary Encoding | enable-dictionary-encoding | | | Specifies whether dictionary encoding should be enabled for the Parquet writer |
Enable Validation | enable-validation | | | Specifies whether validation should be enabled for the Parquet writer |
Writer Version | writer-version | | | Specifies the version used by Parquet writer |
Avro Write Old List Structure | avro-write-old-list-structure | true | | Specifies the value for 'parquet.avro.write-old-list-structure' in the underlying Parquet library |
Avro Add List Element Records | avro-add-list-element-records | true | | Specifies the value for 'parquet.avro.add-list-element-records' in the underlying Parquet library |
Remove CRC Files | remove-crc-files | false | | Specifies whether the corresponding CRC file should be deleted upon successfully writing a Parquet file |
Relationships:
Name | Description |
---|
retry | Flow Files that could not be processed due to issues that can be retried are transferred to this relationship |
success | Flow Files that have been successfully processed are transferred to this relationship |
failure | Flow Files that could not be processed due to issue that cannot be retried are transferred to this relationship |
Reads Attributes:
Name | Description |
---|
filename | The name of the file to write comes from the value of this attribute. |
Writes Attributes:
Name | Description |
---|
filename | The name of the file is stored in this attribute. |
absolute.hdfs.path | The absolute path to the file is stored in this attribute. |
hadoop.file.url | The hadoop url for the file is stored in this attribute. |
record.count | The number of records written to the Parquet file |
State management:
This component does not store state.Restricted:
Required Permission | Explanation |
---|
write distributed filesystem | Provides operator the ability to write any file that NiFi has access to in HDFS or the local filesystem. |
Input requirement:
This component requires an incoming relationship.System Resource Considerations:
None specified.