Hive changelog for Cloudera Data Warehouse on premises

Review the changes introduced in Hive for Cloudera Data Warehouse on premises.

2025.0.19.1-74

  • CDPD-77993: match parquet-hadoop version with downstream
  • DWX-20989: Apply a workaround to bypass the nmap DNS resolution problem in metastore health check script
  • CDPD-80655: HIVE-28878: Query with window function and redundant GROUP BY key fails during compilation
  • DWX-20657: HIVE-28886: CalcitePlanner throws NullPointerException when column name case in ColumnStatisticsObj differs from the table schema
  • DWX-20723: remove USER env from tez-entrypoint.sh script
  • CDPD-80424: Upgrade json-smart and resolve CVE-2015-5237 for Hive
  • DWX-20242:HIVE-28856: Remove Jetty-Runner
  • CDPD-80448: Upgrade com.fasterxml.jackson.core:jackson-core and resolve CVE-2020-36518 for Hive
  • CDPD-80428: HIVE-28040: Upgrade netty to 4.1.116.Final due to CVEs
  • CDPD-80450: HIVE-28859: Upgrade commons-logging to 1.3.5 to fix CVEs
  • CDPD-80433: Upgrade avro and resolve CVE-2024-47561 for Hive
  • CDPD-78222: HIVE-28775: HiveServer2: Implement HA Health Check endpoint on a separate port from WebUI
  • DWX-20242:Exclude mina-core to fix CVE-2024-52046
  • DWX-20179: Set default UTC timezone in chainguard based hive image
  • DWX-18878: Chainguard base image update hive-arm
  • DWX-20179: Set default UTC timezone in chainguard based hive image
  • DWX-18878 (addendum): Fix date command for APP_SUBMIT_TIME_ENV
  • DWX-18878: Chainguard base image update: hive
  • CDPD-76830: HIVE-28778: Support Secure Zookeeper Access in Hive SecretManager
  • DWX-17664 HIVE-28095: Hive Query History
  • CDPD-79468: Excluded javax.el to remove CVE-2021-28170
  • DWX-20248:Fix CVE-2024-23945 in hive image.
  • DWX-20087:Upgrade Spring to 5.3.39-cloudera-5.3.44
  • CDPD-78626 Upgrade velocity-engine-core in order to fix commons-io cve
  • CDPD-78632: HIVE-28610: Bump commons-io:commons-io from 2.12.0 to 2.14.0
  • CDPD-78238: HIVE-28572: Support Distribute by and Cluster by clauses in CBO
  • CDPD-71260: HIVE-27831: Set hive.cbo.fallback.strategy to NEVER by default
  • CDPD-77934: HIVE-28532: Map Join Reuse cache allows to share hashtables for different join types
  • CDPD-77867: HIVE-28589: Not null constraint does not enforced at invalid cast
  • CDPD-75066: HIVE-28512 Retain properties in DDL_CTL_PARAMETERS_WHITELIST
  • CDPD-74622:CDPD-73371:HIVE-28065:HIVE-27477: Upgrade Bouncy castle to bcprov-jdk18on
  • CDPD-77834 (Part 2): Update the golden file of ds-q54.q and re-enable it
  • CDPD-77905 HIVE-28700: MRCompactor may cause data loss when performing the major compaction
  • CDPD-72237: HIVE-28259: Common table expression detection and rewrites using CBO
  • CDPD-77705: HIVE-28408: Support ARRAY field access in CBO
  • CDPD-75656: HIVE-28582: OOM when compiling query with many GROUP BY columns aliased multiple times
  • HIVE-28018: Don't require HiveConf for JDBC Metadata calls
  • HIVE-27874: Support datatype conversion on fetch threads
  • HIVE-27872: Support multi-stream parallel fetch in JDBC driver
  • HIVE-27873: Fix getOperationStatus and optimize fetch
  • HIVE-27887: Provide reasonable defaults for ResultSetMetaData
  • HIVE-27886: Configure Hadoop security for Kerberos when required
  • HIVE-28315: Missing classes while using hive jdbc standalone jar
  • CDPD-77713 HIVE-28669: Deadlock found when TxnStoreMutex trying to acquireLock
  • DWX-19970: HIVE-28701: OTEL: Fix race condition due to unavailability of Query Id.
  • DWX-19969: Split hive into hive-arm on CDWH to support multi-arch image generation
  • DWX-19965: HIVE-28698: OTEL: Add Retry Policy in the Exporter.
  • DWX-19627 HIVE-28620: Query result is cached in case of IOWD if the subquery is not trivial
  • DWX-19928 HIVE-28623: Implement RuntimeContext to decouple runtime data from TezTask
  • DWX-19773: Make commons-configuration dependency of atlas plugin available for hive
  • CDPD-73977: Fix dependencies after Jersey upgrade
  • CDPD-73371: HIVE-28455: Disable createDependencyReducedPom in maven-shade-plugin to not replace the original with dependency-reduced-pom.xml.
  • CDPD-73227 : fixing hive build error
  • CDPD-72360 :CDPD-69448: CDPD-67281 manage Jersey update with shaded Atlas - V2
  • CDPD-72989: HIVE-27477: Upgrade maven-shade-plugin version from 3.2.4 to 3.5.0
  • CDPD-77550: HIVE-28675: Maximize the removal of redundant columns from GROUP BY clauses
  • CDPD-44551:HIVE-26952: Include property metastore.storage.schema.reader.impl by default
  • DWX-19908: HIVE-28661: OTEL: Latency in retrieving query end time leads to thread interruption
  • DWX-19907: HIVE-28660: OTEL: Modify span names to remove inconsistency in case of completed queries
  • DWX-19786: HIVE-28648: OTEL: Add Execution Engine as part of the Query Data.
  • DWX-19767: HIVE-28613: OTEL: Collect & expose JVM metrics for LLAP daemons.
  • DWX-19766: HIVE-28542: OTEL: Implement OTEL Exporter to expose JVM details of HiveServer2
  • DWX-19765: HIVE-28505: OTEL: Implement OTEL Exporter to expose query details from HiveServer2
  • DWX-19911: HIVE-26564: Separate query live operation log and historical operation log
  • DWX-19910: HIVE-24792: Potential thread leak in Operation
  • DWX-19909: HIVE-24802: Show operation log at webui
  • CDPD-69123: Upgrade Avro to latest 1.11.3
  • CDPD-76936:Hive-28245:Upgrade Spring to 5.3.39
  • DWX-19749 HIVE-28638: Refactor stats handling in StatsRecordingThreadPool
  • CDPD-76544: Upgrading library mysql-connector-j to version 8.2.0 for fixing CVE-2023-22102
  • CDPD-77047: HIVE-28594 Handle the issue on HS2 WebUI's LDAP authentication
  • CDPD-76855: HIVE-24657: Make Beeline Logging Dependencies Explicit
  • CDPD-76853: HIVE-26638: Replace in-house CBO reduce expressions rules with Calcite's built-in classes
  • CDPD-76854: HIVE-26626: Cut dependencies between HiveXxPullUpConstantsRule and HiveReduceExpressionsRule
  • CDPD-76733: Upgrading avro library to fix cve CVE-2024-47561
  • CDPD-75555: HIVE-28564: AssertionError for jdbc queries when JDBCExpandExpressionsRule produces non flat Filter conditions
  • CDPD-75355: HIVE-28560: AssertionError in SqlImplementor when query over JDBC table contains LIMIT
  • CDPD-75356: HIVE-24255: StorageHandler with select-limit query is returning 0 rows
  • CDPD-76140: HIVE-27399: Add support for lateral views with CBO
  • DWX-19668 HIVE-28616: OrcReader is not closed in OrcEncodedDataReader
  • CDPD-71682: Backport HIVE-25719: Disable replication_metrics_ingest.q
  • CDPD-71682: Backport HIVE-28354: Fix naming in negative LLAP CLI driver config
  • CDPD-71682: Backport HIVE-25036: Disable script_broken_pipe2
  • CDPD-71682: Backport HIVE-27992: Upgrade to tez 0.10.3
  • CDPD-71682: Backport HIVE-24673: Migrate NegativeCliDriver and NegativeMinimrCliDriver to llap
  • DWX-19635: HIVE-27357: Map-side SMB Join returns incorrect result when tables have different bucket size
  • CDPD-75377: HIVE-28604 Allow HMS to configure the DataNucleus level 1 cache
  • DWX-18611: Query-coordinator fails to start in WaitForLlap if custom jars are present
  • CDPD-74482: HIVE-26968: Wrong results when shared work optimizer merges TS operator with different DPP edges
  • CDPD-74481: HIVE-27375: SharedWorkOptimizer: Fix MapJoin cache reuse
  • CDPD-74531: Metadata Summary creation fails for 7.3.1.0 based PVC runs with Oracle database
  • CDPD-75809: HIVE-28571: Basic UNIONTYPE support in CBO
  • DWX-18611: Query-coordinator fails to start in WaitForLlap if custom jars are present
  • CDPD-74817 HIVE-28544: Ensure using UTF-8 encoding in some String/Char/Varchar related operations
  • CDPD-75665: HIVE-28580 Create table using import command gives ddl pointing to incorrect location
  • CDPD-75048 HIVE-13153: SessionID is appended to thread name twice
  • CDPD-75152 HIVE-28569: ReExecuteLostAMQueryPlugin doesn't kick in if the message contains line break and diagnostics
  • CDPD-75457: Backport HIVE-28249
  • CDPD-58931: HIVE-28573: Unable to execute HPLSQl queries when hive authorization is enabled.
  • CDPD-75556: Call createDmlFragment() when inserting into a partitioned table
  • DWX-19504: HIVE-22948: QueryCache: Treat query cache locations as temporary storage
  • CDPD-44060: HIVE-28416: Another process is rebuilding the materialized view is thrown for subsequent MV rebuilds when the current MV rebuild is aborted for the same MV.
  • CDPD-75398: hive build failure in docker build
  • CDPD-74292: HIVE-28540: Special characters in user DN escaped when querying LDAP
  • DWX-19377: HIVE-28533: Fix compaction with custom pools
  • CDPD-73714: HIVE-28483: CAST string to date should return null when format is invalid
  • CDPD-74764: HIVE-28545 (Addendum) Remove global lock on HMSHandler.getMSForConf which is prone to deadlock
  • CDPD-74764: HIVE-28545: Remove global lock on HMSHandler.getMSForConf which is prone to deadlock
  • CDPD-57243: HIVE-28511: add column access information to HivePrivilegeObject
  • CDPD-74640: HIVE-25907: IOW Directory queries fails to write data to final path when query result cache is enabled
  • CDPD-74640: HIVE-21386: Extend the fetch task enhancement done in HIVE-21279 to make it work with query result cache
  • CDPD-74539: Make MariaDB fall back to MySQL
  • CDPD-74640: HIVE-28530: Set files in thread safe manner in HiveSequenceFileInputFormat
  • CDPD-74479: HIVE-28435: Upgrade cron-utils to 9.2.1
  • CDPD-74225: HIVE-28480: Disable SMB on partition hash generator mismatch across join branches in previous RS
  • DWX-17905 HIVE-28165: HiveSplitGenerator: send splits through filesystem instead of RPC in case of big payload
  • DWX-19345: HIVE-28502: Refactor method names that start with capital letters in PasswdAuthenticationProvider class
  • CDPD-74278: HIVE-27080: Support project pushdown in JDBC storage handler even when filters are not pushed
  • CDPD-74234: HIVE-28264: OOM/slow compilation when query contains SELECT clauses with nested expressions
  • CDPD-71484 HIVE-27884: LLAP: Reuse FileSystem objects from cache across different tasks in the same LLAP daemon
  • CDPD-74337: HIVE-26580: SHOW COMPACTIONS should support ordering and limiting functionality
  • CDPD-72985: HIVE-27984: Addendum: Support backward compatibility of hms thrift struct about column stats
  • DWX-19301: HIVE-28457: HS2 WEBUI: LDAP authentication
  • CDPD-72985: HIVE-27984: Support backward compatibility of hms thrift struct about column stats
  • CDPD-74205: HIVE-28484: SharedWorkOptimizer leaves residual unused operator tree that send DPP events to unknown operators
  • HIVE-27405: Throw out the detail error Invalid partition name to the clients
  • CDPD-73727: HIVE-28321: Support select alias in the having clause for CBO
  • CDPD-23041: HIVE-24815: Remove IDXS Table from Metastore Schema
  • CDPD-67834:HIVE-28375:Upgrade Nimbus-JOSE-JWT to 9.37.3
  • CDPD-74120: HIVE-28333: Skip CBO for UNIQUEJOIN and TABLESAMPLE (#5307)
  • CDPD-72007: HIVE-28310: Disable hive.optimize.join.disjunctive.transitive.predicates.pushdown by default
  • CDPD-73669: HIVE-28456: ObjectStore updatePartitionColumnStatisticsInBatch can cause connection starvation
  • DWX-18831: Enable Multi-arch docker builds for Hive on cdwh stack
  • CDPD-73304: HIVE-27730: Bump org.xerial.snappy:snappy-java from 1.1.10.1 to 1.1.10.4
  • CDPD-73303: HIVE-28313: Support ambiguous column references when CBO fallback is disabled
  • CDPD-71596: HIVE-28409: Column lineage when creating view is missing if atlas HiveHook is set
  • CDPD-72968: Add ifExists field to dropCatalogRequest
  • CDPD-73269: HIVE-28431: Fix RexLiteral to ExprNode conversion if the literal is an empty string
  • CDPD-73046 HIVE-28028: Remove duplicated proto reader/writer classes introduced in HIVE-19288
  • CDPD-70951:Upgrade Aircompressor to 0.27 due to CVE-2024-36114
  • CDPD-66081:derby version to 10.14.3.0-cloudera1
  • CDPD-62654: HIVE-27746: Hive Metastore should send single AlterPartitionEvent with list of partitions
  • CDPD-72840: HIVE-28143: After HIVE-27492 fix, some HPLSQL built-in functions like trim, lower are not working when used in insert statement.
  • CDPD-72492: HIVE-28270: Fix missing partition paths bug on drop_database
  • DWX-18297: Moving hms-module build to be part of full-build profile
  • CDPD-72605: HIVE-28371: Optimize add partitions authorization in HiveMetaStore
  • CDPD-72491: HIVE-28338: HiveMetaStoreClient connection count is not correct
  • CDPD-72470: HIVE-28314: Support non-boolean WHERE conditions in CBO
  • CDPD-72495: HIVE-28302: Let SUM UDF return NULL when all rows have non-numeric texts
  • CDPD:13332: Backport HIVE-25532: Fixing authorization for Kill Query command.
  • CDPD:13332: Backport HIVE-24751: Kill trigger in workload manager fails with "No privilege" exception when authorization is disabled
  • CDPD-13332: HIVE-23084: Implement kill query in multiple HS2 environment
  • CDPD-71122: HIVE-28286: Add filtering support for get_table_metas
  • CDPD-71687 HIVE-27972: Set 'tez' as default value in hive.execution.engine
  • CDPD-71687 HIVE-24707: Apply Sane Default for Tez Containers as Last Resort
  • CDPD-67597:Upgrade postgresql version in packaging
  • CDPD-71394 HIVE-28324: HIVE_CLUSTER_ID in env: unified way to mark a cluster - addendum empty string default
  • CDPD-71394 HIVE-28324: HIVE_CLUSTER_ID in env: unified way to mark a cluster
  • CDPD-71387: HIVE-28325: Slow compilation without owner information with Ranger
  • CDPD-70956: HIVE-28285: Exception when querying JDBC tables with Hive/DB column types mismatch
  • CDPD-67649: HIVE-28098: Fails to copy empty column statistics of materialized CTE
  • CDPD-71104: HIVE-28082: HiveAggregateReduceFunctionsRule could generate an inconsistent result
  • CDPD-67648: HIVE-28080: Propagate statistics from a source table to the materialized CTE
  • CDPD-71103: HIVE-28254: CBO (Calcite Return Path): Multiple DISTINCT leads to wrong results
  • CDPD-71184 HIVE-28312: Save query error message on Driver level
  • CDPD-71188 HIVE-27023: Add setting to prevent tez session from being opened during startup
  • DWX-18549: ASYNC_PROFILE_HOME env var points to invalid location
  • DWX-18550 HIVE-28305: ProfileServlet: add html to output formats and prepare for profiler 3.0
  • CDPD-71107: Update some defaults in download_dep_tars.sh
  • CDPD-69970: HIVE-27498: Support custom delimiter in SkippingTextInputFormat
  • CDPD-64754: HIVE-28202: Incorrect projected column size after ORC upgrade to v1.6.7
  • DWX-18331: HIVE-28121: Use direct SQL for transactional altering table parameter
  • CDPD-69326: HIVE-28287: Attempt make the scratch directory writable before failing.
  • CDPD-69080: HIVE-28190: Fix MaterializationRebuild lock heartbeat
  • CDPD-66418: Backport HIVE-27481: Addendum: Fix post-refactor issues
  • CDPD-64851: Backport HIVE-27481: TxnHandler cleanup
  • CDPD-64654: Backport HIVE-27406: CompactionTxnHandler cleanup
  • CDPD-43503: HIVE-26566: Upgrade H2 database version to 2.1.214
  • CDPD-69885: HIVE-28253: Unable to set the value for hplsql.onerror in hplsql mode.
  • CDPD-69724: HIVE-28247: Execute immediate `select count(*) from tbl` throwing ClassCastException in hplsql mode.
  • CDPD-69648: HIVE-28215: Signalling CONDITION HANDLER is not working in HPLSQL.
  • CDPD-69647: HIVE-28214: HPLSQL not using the hive variables passed through beeline using --hivevar option
  • CDPD-62373: HIVE-27755: Quote identifiers in SQL emitted by SchemaTool for MySQL
  • CDPD-24384 Backport HIVE-28173: Fixed staging dir issues on materialized views on HDFS encrypted tables
  • DWX-17033: Fixed NPE for UA non BinaryPredicate join condition.
  • CDPD-69227 HIVE-28212: MiniHS2: use a base folder which is more likely writable on the local FS