Supports Expression Language: true (will be evaluated using flow file attributes and variable registry)Page Size | page-size | | | The page size used by the Parquet writer. The value is specified in the format of <Data Size> <Data Unit> where Data Unit is one of B, KB, MB, GB, TB. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Dictionary Page Size | dictionary-page-size | | | The dictionary page size used by the Parquet writer. The value is specified in the format of <Data Size> <Data Unit> where Data Unit is one of B, KB, MB, GB, TB. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Max Padding Size | max-padding-size | | | The maximum amount of padding that will be used to align row groups with blocks in the underlying filesystem. If the underlying filesystem is not a block filesystem like HDFS, this has no effect. The value is specified in the format of <Data Size> <Data Unit> where Data Unit is one of B, KB, MB, GB, TB. Supports Expression Language: true (will be evaluated using flow file attributes and variable registry) |
Enable Dictionary Encoding | enable-dictionary-encoding | | | Specifies whether dictionary encoding should be enabled for the Parquet writer |
Enable Validation | enable-validation | | | Specifies whether validation should be enabled for the Parquet writer |
Writer Version | writer-version | | | Specifies the version used by Parquet writer |
Relationships:
Name | Description |
---|
success | Parquet file that was converted successfully from Avro |
failure | Avro content that could not be processed |
Reads Attributes:
None specified.Writes Attributes:
Name | Description |
---|
filename | Sets the filename to the existing filename with the extension replaced by / added to by .parquet |
record.count | Sets the number of records in the parquet file. |
State management:
This component does not store state.Restricted:
This component is not restricted.Input requirement:
This component requires an incoming relationship.System Resource Considerations:
None specified.