What's New in Apache Hive
Learn about the new features of Hive in Cloudera Runtime 7.1.7.
Hive Warehouse Connector Support for Spark Data Source V2
This release introduces vectorization in the Hive Warehouse Connector through the support
for Data Source V2. To enable Data Source V2, you configure
spark.datasource.hive.warehouse.read.mode=DIRECT_READER_V2
. For details,
see Reading Data through HWC.
HSMM configuration for expediting an upgrade to CDP
Customers who upgrade Hive 2 to Hive 3 can expedite the upgrade process using a new Hive Strict Metastore Migration (HSMM) configuration capability. You can configure HSMM to prevent migration of your databases and tables. Postponing migration until after the upgrade to clean the data typically saves time when you have tables using SERDEs that have been abandoned, ACIDv1 tables, and partition location problems. For details, see Expediting the Hive upgrade.
Direct Reader support for Spark-consistent timestamps
When reading tables with Hive Warehouse Connector (HWC) Direct Reader, the behavior of the timestamp type is consistent with Spark. When querying a Hive table using Beeline, the behavior of the timestamp type is also consistent with Spark except when the table is in Parquet format. For details, see Reading Data through HWC.