Impala changelog for CDW Public Cloud

Review the changes introduced in Impala for Cloudera Data Warehouse Public Cloud.

2024.0.18.4 (December 5, 2024) GA

  • CDPD-75836: Selectively backport CDPD-75588 to CDWH-2024.0.18.4
  • CDPD-45946: IMPALA-13386: Fix test_metastore_service.py after HIVE-27984
  • CDPD-45946: Fix test_metastore_service.py after HIVE-27984
  • CDPD-69857: FENG: Disable test of insert partition with special characters in partition name
  • CDPD-72961: IMPALA-13280: Speed up test_under_statement_expression_limit
  • CDPD-75562: IMPALA-13469: Deflake test_query_cpu_count_on_insert
  • IMPALA-13445: Ignore num partition for unpartitioned writes
  • IMPALA-13312: Use client address from X-Forwarded-For Header in Ranger Audit Logs
  • IMPALA-13310 Add the value of the http 'X-Forwarded-For' header to the runtime profile
  • CDPD-73442: IMPALA-13313: Fix ExpireQueries deadlock

2024.0.18.2-4 (October 4, 2024)

No new features or fixes.

2024.0.18.1-1 (August 15, 2024)

  • IMPALA-13270: Addressing IllegalStateException in Complex Views post upgrade
  • IMPALA-13272: Stability Improvement for analytic functions on collections

2024.0.18.0-206 (July 26, 2024)

  • IMPALA-13137: Add additional client fetch metrics columns to the queries page
  • IMPALA-12800: Implement hashCode everywhere
  • IMPALA-13159: Fix query cancellation caused by statestore failover
  • IMPALA-13152: Avoid NaN, infinite, and negative ProcessingCost
  • IMPALA-13138: Never smallify existing StringValue objects, only new ones during DeepCopy
  • IMPALA-13131: Azure OpenAI API expects 'api-key' instead of 'Authorization' in the request header
  • IMPALA-12800: Add cache for isTrueWithNullSlots() evaluation
  • IMPALA-12800: Use HashMap for ExprSubstitutionMap lookups
  • IMPALA-13119: Fix cost_ initialization at CostingSegment.java
  • IMPALA-12616: Fix test_restart_services.py::TestRestart tests for S3
  • IMPALA-13129: Move runtime filter skipping at registerRuntimeFilter
  • IMPALA-8042: Assign BETWEEN selectivity for discrete-unique column
  • IMPALA-12800: Skip O(n^2) ExprSubstitutionMap::verify() for release builds
  • IMPALA-13040: Inject larger delay for sanitized build
  • IMPALA-13134: DDL hang with SYNC_DDL enabled when Catalogd is changed to standby status
  • IMPALA-11499: Refactor UrlEncode function to handle special characters
  • IMPALA-13040: Add waiting mechanism in UpdateFilterFromRemote
  • IMPALA-13018: Block push down of conjuncts with implicit casting on base columns for jdbc tables
  • IMPALA-13058: Init first_arrival_time_ and completion_time_ with -1
  • IMPALA-13054: Avoid revisiting children in QueryStateExpanded
  • IMPALA-13061: Create query live as external table
  • IMPALA-12910: Support running TPCH/TPCDS queries for JDBC tables
  • IMPALA-13049: Add dependency management for log4j2 to use 2.18.0
  • IMPALA-13045: Wait for impala_query_live to exist
  • IMPALA-12997: Use graceful shutdown for query log tests
  • IMPALA-13012: Lower default query_log_max_queued
  • IMPALA-13005: Create Query Live table in HMS
  • IMPALA-13024: Ignore slots if using default pool and empty group
  • IMPALA-12657: Improve ProcessingCost of ScanNode and NonGroupingAggregator
  • IMPALA-12988: Calculate an unbounded version of CpuAsk
  • IMPALA-12980: Translate CpuAsk into admission control slots
  • IMPALA-12965: Add debug query option RUNTIME_FILTER_IDS_TO_SKIP
  • IMPALA-13004: Fix heap-use-after-free error in ExprTest AiFunctionsTest
  • IMPALA-12874: Identify active and standby catalog and statestore in the web debug endpoint
  • IMPALA-12564: Prevent Hive loading libfesupport.so in the minicluster during TSAN runs
  • IMPALA-12998: Add SHOW_METADATA_TABLES to ignored DDL
  • IMPALA-12963: Return parent PID when children spawned
  • IMPALA-12920: Support ai_generate_text built-in function for OpenAI's chat completion API
  • IMPALA-12969: Release JNI array if DeserializeThriftMsg failed
  • IMPALA-12881: Use getFkPkJoinCardinality to reduce scan cardinality
  • IMPALA-11430: Support custom hash schema for Kudu range tables
  • IMPALA-12960: Fix Incorrect RowsPassedThrough Metric in Streaming Aggregation
  • IMPALA-12925: Fix decimal data type for external JDBC table
  • IMPALA-12899: Temporary workaround for BINARY in complex types
  • IMPALA-12602: Unregister queries on idle timeout
  • IMPALA-12782: Show info of the event processing in /events webUI
  • IMPALA-12852: Make Kudu service start and stop independent
  • IMPALA-12426: Switches the duration fields to be stored in decimal seconds.
  • IMPALA-12944: Fixes Workload Management Test Flakiness
  • IMPALA-12426: Skip Inserting HS2 Operation Queries into the Completed Queries Table
  • IMPALA-12626: Add Tables Queried to profile/history
  • IMPALA-12264: Add limit on number of HS2 sessions per user.
  • IMPALA-12540: Add EventSequence arg to load
  • IMPALA-12540: Query Live Table
  • IMPALA-12824: Removes the prettyprint_duration Built-in Function
  • IMPALA-12942: deflake test_virtual_column_file_position_generic
  • IMPALA-12487: Skip reloading file metadata for ALTER_TABLE events with trivial changes in StorageDescriptor
  • IMPALA-12928: Mask JDBC table property dbcp.password for DESC FORMATTED and SHOW CREATE TABLE
  • IMPALA-12913: Refactor Workload Management Custom Cluster Tests
  • IMPALA-12699: Set timeout for catalog RPCs
  • IMPALA-12807: Add support for mold linker
  • IMPALA-12915: Use libgtest.so when built with shared libs
  • IMPALA-12903: Querying virtual column FILE__POSITION for TEXT and JSON tables crashes Impala
  • IMPALA-12896: JDBC table must be created as external table
  • IMPALA-4545: Simplify test dimension in test_decimal_casting.py
  • IMPALA-12898: Tidy up test dimensions of test_scanner.py
  • IMPALA-12929: Skip loading HDFS permissions in local-catalog mode
  • IMPALA-12930: Fix TestExtDataSources.test_jdbc_data_source failure
  • IMPALA-12923: Fix header alignment during horizontal scrolling in query timeline
  • IMPALA-12926: Refactor BINARY type handling in the backend
  • IMPALA-12819: Sync add-opens targets
  • IMPALA-12443: Add catalog timeline for all DDL profiles
  • IMPALA-11938: Raised error if NUM_NODES is set to invalid value.
  • IMPALA-12802: Support ALTER TABLE for JDBC tables
  • IMPALA-12924: Upgrade xmlsec to 2.2.6
  • IMPALA-12426: Query History Table
  • IMPALA-12883: Support updating the charge on an entry in the cache
  • IMPALA-12904: test_type_conversions_hive3 silently passes because of wrongly defined test dimensions
  • IMPALA-12914: TestBannedLogMessages no longer requires running cluster
  • IMPALA-12818: Intermediate Result Caching plan node framework
  • IMPALA-12835: Fix event processing without hms_event_incremental_refresh_transactional_table
  • IMPALA-12917: Skip TestEventProcessingError tests requiring tez execution on hive
  • IMPALA-12916: Fix test_event_processor_error_global_invalidate test random failure
  • IMPALA-12896: Avoid JDBC table to be set as transactional table
  • IMPALA-12831: Fix HdfsTable.toMinimalTCatalogObject() failed by concurrent modification
  • IMPALA-12832: Implicit invalidate metadata on event failures
  • IMPALA-12798: Tolerate floationg point precision for regr_intercept() TIMESTAMP test
  • IMPALA-12730: Don't use -Weverything for clang-tidy
  • IMPALA-12849: Standby catalogd should reject requests from coordinators
  • IMPALA-12855: Fix NPE in firing RELOAD events when the partition doesn't exist
  • IMPALA-12678: Deflake test_skipping_batching_events
  • IMPALA-12878: Report invalid query if finalized
  • IMPALA-12860: Invoke validateDataFilesExist for RowDelta operations
  • IMPALA-12864: Deflake test_query_log_size_in_bytes.
  • IMPALA-12851: Fix AllocWriteIdEvent process issue to add txnId-tableWriteIds mapping
  • IMPALA-12815: Support timestamp for scan predicates for external data source table.
  • IMPALA-12833: Enabled 'catalogd_ha_reset_metadata_on_failover' by default
  • IMPALA-12821: Fix test_create_table_profile_events depend on event-processor
  • IMPALA-12825: Install thrift into the impala-python virtualenv
  • IMPALA-12834: Add number of concurrent queries to profile
  • IMPALA-12859: Ensure query log estimated memory is initialized
  • IMPALA-12838: Adds exec_options parameter to single_node_perf_run.py
  • IMPALA-12717: Fix error message for missing table descriptor
  • IMPALA-12426: QueryStateRecord Refactor
  • IMPALA-12827: Fix failures in processing AbortTxnEvent due to aborted write id is cleaned up
  • IMPALA-12426: Workload Management Supporting Changes
  • IMPALA-12840: Exclude THdfsFileDesc in getJsonCatalogObject
  • IMPALA-12793: Create JDBC table without data source
  • IMPALA-12823: Fix repeated query not found messages in impalad.INFO logs
  • IMPALA-12801: Increase query_log_ default size and bound its memory.
  • IMPALA-12426: Remove Usage of Unnecessary this->
  • IMPALA-12433: Share buffers among channels in KrpcDataStreamSender
  • IMPALA-12824: Adds built-in functions prettyprint_duration and prettyprint_bytes.
  • IMPALA-12842: Make node.js installation aware of CPU architecture
  • IMPALA-12830: test_webserver_hide_logs_link() could fail in the exhaustive build
  • IMPALA-12847: Expose computeScanRangeLocations and computeStats
  • IMPALA-11123: Reimplement ORC optimized count star
  • IMPALA-9086: Show Hive configurations in /hadoop-varz page
  • IMPALA-12796: Add is_footer_only in TFileSplitGeneratorSpec
  • IMPALA-12631: Improve count star performance for parquet scans
  • IMPALA-12811: Exception during re-analyze can be lost
  • IMPALA-12814: Allow use of row_regex for VERIFY_IS_NOT_IN
  • IMPALA-11568: Fix SamplingTimeSeriesCounter not correctly cleared
  • IMPALA-12816: Remove exclusion of Netty to allow SSL connections to HBase
  • IMPALA-12790: Fix overestimation in ScanNode.getInputCardinality
  • IMPALA-12803: Fix missing exchange lines in query timeline
  • IMPALA-12795: Deflake test_catalog_operation_fields
  • IMPALA-12688: Support JSON profile imports in webUI
  • IMPALA-12744: Support for regr_count() aggregate/analytic function
  • IMPALA-12788: Fix HBaseTable still get loaded even if HBase is down
  • IMPALA-12378: Add commons-dbcp2.version replacement
  • IMPALA-12578: Pass owner user of database and table to Ranger in GRANT/REVOKE
  • IMPALA-12378: Auto Ship JDBC Data Source
  • IMPALA-12780: Only show non-default options in the catalog operations page
  • IMPALA-12784: Fix bug in ScanNode.getFilteredInputCardinality
  • IMPALA-12503: Support date data type for predicates for external data source table
  • IMPALA-12783: Nested struct with varlen data crashes
  • IMPALA-12726: Simulate large-scale query in TpcdsCpuCostPlannerTest
  • IMPALA-12655: Stabilize compute-table-stats.sh
  • IMPALA-12770: Fix infinite loop for nested Case expressions
  • IMPALA-12781: ARRAY<STRUCT<s: STRING> crashes in top-n
  • IMPALA-12718: Provides UTF-8 support for the trim functions
  • IMPALA-12778: Fix time strings in catalog operations page
  • IMPALA-12642: Fixed unit-test to verify query options for JDBC external table
  • IMPALA-12695: Crash with UNION with complex types
  • IMPALA-12448: Avoid getting stuck when refreshing a non-existent partition
  • IMPALA-12471: skip mysql ext jdbc tests if setup environment fails.
  • IMPALA-12763: Union with string struct crashes in ASAN
  • IMPALA-12762: Fix cmake error in package building
  • IMPALA-12767: Upgrade Guava to 32.0.1 due to CVE-2023-2976
  • IMPALA-12463: Batch non-consecutive table events in the event processor
  • IMPALA-12745: Skip parallel symbol dumping with RPM/DEB packages
  • IMPALA-12125: Support for dumping symbols from RPMs without separate symbols
  • IMPALA-12746: Bump jackson.databind to 2.15.3
  • IMPALA-12727: Reduce IO threads for non-TARGET_FILESYSTEM filesystems
  • IMPALA-12740: Fix TestHdfsJsonScanNodeErrors fails in exhaustive mode
  • IMPALA-12711: Fix DDL errors are not shown in impalad logs
  • IMPALA-12590: Fix dmesg call during precommit for Ubuntu 20.04
  • IMPALA-12584: Enable strict data file access by default
  • IMPALA-12038: Switch report_benchmark_results.py to python 3
  • IMPALA-12716: Fix timeout thresholds in test_catalog_operations_with_rpc_retry
  • IMPALA-12431: Support reading compressed JSON file
  • IMPALA-12690: Add TpcdsCpuCostPlannerTest
  • IMPALA-12642: Support query options for Impala external JDBC table
  • IMPALA-12054: Lazily check Kudu flags in tests
  • IMPALA-12582: Fix crash when enabling MIN_MAX RuntimeFilter in Nested Loop Join
  • IMPALA-12665: Adjust complete_micro_batch_ length to new scratch_batch_->capacity after ScratchTupleBatch::Reset
  • IMPALA-12687: Fix key conflicts in tracking in-flight catalog operations
  • IMPALA-12356: Fix first ALTER_PARTITION event from Hive could be treated as self event
  • IMPALA-12747: Atomic update of execution state
  • IMPALA-12719: Reload filemetadata for AlterTable event of type truncate
  • IMPALA-12566: Fix RpcMgrKerberizedTest on RedHat 8
  • IMPALA-12743: Fix incremental stats are filtered out by HMS due to HIVE-27114