PutParquet 2.3.0.4.10.0.0-147

Bundle
org.apache.nifi | nifi-parquet-nar
Description
Reads records from an incoming FlowFile using the provided Record Reader, and writes those records to a Parquet file. The schema for the Parquet file must be provided in the processor properties. This processor will first write a temporary dot file and upon successfully writing every record to the dot file, it will rename the dot file to it's final name. If the dot file cannot be renamed, the rename operation will be attempted up to 10 times, and if still not successful, the dot file will be deleted and the flow file will be routed to failure. If any error occurs while reading records from the input, or writing records to the output, the entire dot file will be removed and the flow file will be routed to failure or retry, depending on the error.
Tags
HDFS, filesystem, hadoop, parquet, put, record
Input Requirement
REQUIRED
Supports Sensitive Dynamic Properties
false
Properties
Restrictions
Required Permission Explanation
write distributed filesystem Provides operator the ability to write any file that NiFi has access to in HDFS or the local filesystem.
Relationships
Name Description
failure Flow Files that could not be processed due to issue that cannot be retried are transferred to this relationship
retry Flow Files that could not be processed due to issues that can be retried are transferred to this relationship
success Flow Files that have been successfully processed are transferred to this relationship
Reads Attributes
Name Description
filename The name of the file to write comes from the value of this attribute.
Writes Attributes
Name Description
filename The name of the file is stored in this attribute.
absolute.hdfs.path The absolute path to the file is stored in this attribute.
hadoop.file.url The hadoop url for the file is stored in this attribute.
record.count The number of records written to the Parquet file