What's New in Apache Hive

Learn about the new features of Hive in Cloudera Runtime 7.1.7.

Hive Warehouse Connector Support for Spark Data Source V2

This release introduces vectorization in the Hive Warehouse Connector through the support for Data Source V2. To enable Data Source V2, you configure spark.datasource.hive.warehouse.read.mode=DIRECT_READER_V2. For details, see Reading Data through HWC.

HSMM configuration for expediting an upgrade to CDP

Customers who upgrade Hive 2 to Hive 3 can expedite the upgrade process using a new Hive Strict Metastore Migration (HSMM) configuration capability. You can configure HSMM to prevent migration of your databases and tables. Postponing migration until after the upgrade to clean the data typically saves time when you have tables using SERDEs that have been abandoned, ACIDv1 tables, and partition location problems. For details, see Expediting the Hive upgrade.

Direct Reader support for Spark-consistent timestamps

When reading tables with Hive Warehouse Connector (HWC) Direct Reader, the behavior of the timestamp type is consistent with Spark. When querying a Hive table using Beeline, the behavior of the timestamp type is also consistent with Spark except when the table is in Parquet format. For details, see Reading Data through HWC.