Know more about the behavioral changes in cumulative hotfix 13 for 7.1.9
SP1.
Hive
- CDPD-92002: New configuration properties added for Hive metatool
summary operation
- A new configuration property is available to manage the performance
and scope of the metatool summary operation, particularly when dealing with the task running
indefinitely.
- Previous behavior:
- The Hive metatool summary operation executed without specific
controls for resource usage, time limits.
- New behavior:
- A new configuration property is available to manage the performance
and scope of the metatool summary operation, particularly when dealing with the task running
indefinitely.
| Property |
Default Value |
Description |
hive.metatool.summary.timeout |
20 min |
The maximum time the summary operation will wait to complete before being
automatically canceled. |
Impala
- CDPD-89269: Batched Insert events for Impala
- Previous behavior:
- Impala initiated insert statements using a single API call for each
partition. This could become a performance bottleneck when inserting data into tables with a
large number of partitions.
- New behavior:
- This change speeds up computing file checksums during
INSERT operations, especially for tables with many files and partitions.
This is achieved by leveraging a new batch insert API in the Hive Metastore to process events
more efficiently.
Apache Jira: IMPALA-14051