Iceberg changelog for Cloudera Data Warehouse on premises

Review the changes introduced in Iceberg for Cloudera Data Warehouse on premises.

2025.0.19.1-74

Hive - Iceberg changes
  • CDPD-72164: HIVE-28276: Iceberg: Make Iceberg split threads configurable when table scanning
  • CDPD-72045: HIVE-28368: Iceberg: Unable to read PARTITIONS Metadata table.
  • CDPD-71812: HIVE-28353: Iceberg: Reading *Files Metadata table files if the column is of TIMESTAMP type.
  • CDPD-70374: HIVE-28275: Iceberg: Add support for 'If Not Exists' and 'or Replace' for Create Tag.
  • CDPD-72046: HIVE-28299: Iceberg: Optimize show partitions through column projection
  • DWX-18658: HIVE-28256: Iceberg: Major QB Compaction on partition level with evolution
  • CDPD-71472: HIVE-28323: Iceberg: Allow reading tables irrespective whether they were created with hive engined enabled or not.
  • DWX-18477: HIVE-28282: Merging into iceberg table fails with copy on write when values clause has a function call
  • CDPD-70373: HIVE-28274: Iceberg: Add support for 'If Not Exists' and 'or Replace' for Create Branch.
  • CDPD-70435: HIVE-27880: Iceberg: Support creating a branch on an empty table
  • CDPD-70188: HIVE-28278: Iceberg: Stats: IllegalStateException Invalid file: file length 0
  • CDPD-69309: HIVE-28132: Iceberg: Add support for Replace Tag.
  • CDPD-69704: HIVE-28266: Iceberg: select count(*) from data_files metadata tables gives wrong result
  • CDPD-69311: HIVE-28225: Iceberg: Delete on entire table fails on COW mode.
  • DWX-17603: HIVE-28077: Iceberg: Major QB Compaction on partition level
  • CDPD-68139: HIVE-28131: Iceberg: Add support for Replace Branch.
Impala - Iceberg changes
    • IMPALA-13932: Add file path and position-based duplicate check for IcebergMergeNode
    • IMPALA-13825: Extend Docker container build to custom base images
    • IMPALA-13854: IcebergPositionDeleteChannel uses incorrect capacity
    • IMPALA-13853: Don't adjust Iceberg field IDs for data files that don't have complex types
    • IMPALA-13737: Directly load file metadata via IcebergFileMetadataLoader
    • IMPALA-13789: Defer creating Path objects in loading file metadata
    • IMPALA-13772: Fix Workload Management DMLs Timeouts
    • IMPALA-13768: Redundant Iceberg delete records are shuffled around which cause error "Invalid file path arrived at builder"
    • IMPALA-13594: Read Puffin stats also from older snapshots
    • CDPD-78207: Disable ESTIMATE_DUPLICATE_IN_PREAGG in downstream
    • IMPALA-13205: Do not include Iceberg position fields for MERGE statements with INSERT merge clauses
    • IMPALA-13656: MERGE redundantly accumulates memory in HDFS WRITER
    • IMPALA-13324: Enable statement rewrite for merge queries for IcebergMergeImpl
    • IMPALA-13655: UPDATE redundantly accumulates memory in HDFS WRITER
    • IMPALA-13501: Clean up uncommitted Iceberg files after validation check failure
    • IMPALA-13086: Lower AggregationNode estimate using stats predicate
    • IMPALA-13305: Better thrift compatibility checks based on pyparsing
    • IMPALA-13589: SELECT INPUT__FILE__NAME can crash Impala
    • IMPALA-11265: Part2: Store Iceberg file descriptors in encoded format
    • IMPALA-13370: Read Puffin stats from metadata.json property if available
    • IMPALA-13495: Make exceptions from the Calcite planner easier to classify
    • IMPALA-13484: Don't call alter_table() on HMS when loading Iceberg table
    • IMPALA-13325: Use RowBatch::CopyRows in IcebergDeleteNode
    • IMPALA-13467: Fix partition list size calculation for empty Iceberg scan nodes
    • IMPALA-13247: Support Reading Puffin files for the current snapshot
    • IMPALA-12861: Fix mixed file format listing for Iceberg tables
    • IMPALA-13463: Impala should ignore case of Iceberg schema elements
    • IMPALA-13425: Iceberg tables crash server with Calcite planner
    • IMPALA-13364: Schema resolution doesn't work for migrated partitioned Iceberg tables that have complex types
    • IMPALA-13220: Docs for Iceberg DROP PARTITION
    • IMPALA-11265: Part1: Clear GroupContentFiles once used
    • IMPALA-12732: Add support for MERGE statements for Iceberg tables
    • IMPALA-13254: Optimize REFRESH for Iceberg tables
    • IMPALA-12867: Filter files to OPTIMIZE based on file size
    • IMPALA-13274: Filter out illegal output for certain join nodes
    • IMPALA-13296: Check column compatibility earlier for table migration
    • IMPALA-12850: Add better error message for REFRESH iceberg_tbl PARTITION(...)
    • IMPALA-12857: Add flag to enable merge-on-read even if tables are configured with copy-on-write
    • IMPALA-13088, IMPALA-13109: Use RoaringBitmap instead of sorted vector of int64s
    • IMPALA-13085: Add warning and NULL out DECIMAL values in Iceberg metadata tables
    • IMPALA-13079: Add support for FLOAT/DOUBLE in Iceberg metadata tables
    • IMPALA-11499: Refactor UrlEncode function to handle special characters
    • IMPALA-13035: Querying metadata tables from non-Iceberg tables throws IllegalArgumentException
    • IMPALA-12973,IMPALA-11491,IMPALA-12651: Support BINARY nested in complex types in select list
    • IMPALA-13002: Iceberg V2 tables with Avro delete files aren't read properly
    • IMPALA-12543: Detect self-events before finishing DDL
    • IMPALA-12990: Fix impala-shell handling of unset rows_deleted
    • IMPALA-13003: Handle Iceberg AlreadyExistsException
    • IMPALA-13006: Restrict Iceberg tables to Parquet
    • IMPALA-12996: Add support for DATE in Iceberg metadata tables
    • IMPALA-12810: Simplify IcebergDeleteNode and IcebergDeleteBuilder
    • IMPALA-12991: Eliminate unnecessary SORT for Iceberg DELETEs
    • IMPALA-12970: Fix ConcurrentModificationException for Iceberg table scans