Iceberg changelog for Cloudera Data Warehouse on premises
Review the changes introduced in Iceberg for Cloudera Data Warehouse on premises.
2025.0.19.1-74
- Hive - Iceberg changes
-
- CDPD-72164: HIVE-28276: Iceberg: Make Iceberg split threads configurable when table scanning
- CDPD-72045: HIVE-28368: Iceberg: Unable to read PARTITIONS Metadata table.
- CDPD-71812: HIVE-28353: Iceberg: Reading *Files Metadata table files if the column is of TIMESTAMP type.
- CDPD-70374: HIVE-28275: Iceberg: Add support for 'If Not Exists' and 'or Replace' for Create Tag.
- CDPD-72046: HIVE-28299: Iceberg: Optimize show partitions through column projection
- DWX-18658: HIVE-28256: Iceberg: Major QB Compaction on partition level with evolution
- CDPD-71472: HIVE-28323: Iceberg: Allow reading tables irrespective whether they were created with hive engined enabled or not.
- DWX-18477: HIVE-28282: Merging into iceberg table fails with copy on write when values clause has a function call
- CDPD-70373: HIVE-28274: Iceberg: Add support for 'If Not Exists' and 'or Replace' for Create Branch.
- CDPD-70435: HIVE-27880: Iceberg: Support creating a branch on an empty table
- CDPD-70188: HIVE-28278: Iceberg: Stats: IllegalStateException Invalid file: file length 0
- CDPD-69309: HIVE-28132: Iceberg: Add support for Replace Tag.
- CDPD-69704: HIVE-28266: Iceberg: select count(*) from data_files metadata tables gives wrong result
- CDPD-69311: HIVE-28225: Iceberg: Delete on entire table fails on COW mode.
- DWX-17603: HIVE-28077: Iceberg: Major QB Compaction on partition level
- CDPD-68139: HIVE-28131: Iceberg: Add support for Replace Branch.
- Impala - Iceberg changes
-
-
- IMPALA-13932: Add file path and position-based duplicate check for IcebergMergeNode
- IMPALA-13825: Extend Docker container build to custom base images
- IMPALA-13854: IcebergPositionDeleteChannel uses incorrect capacity
- IMPALA-13853: Don't adjust Iceberg field IDs for data files that don't have complex types
- IMPALA-13737: Directly load file metadata via IcebergFileMetadataLoader
- IMPALA-13789: Defer creating Path objects in loading file metadata
- IMPALA-13772: Fix Workload Management DMLs Timeouts
- IMPALA-13768: Redundant Iceberg delete records are shuffled around which cause error "Invalid file path arrived at builder"
- IMPALA-13594: Read Puffin stats also from older snapshots
- CDPD-78207: Disable ESTIMATE_DUPLICATE_IN_PREAGG in downstream
- IMPALA-13205: Do not include Iceberg position fields for MERGE statements with INSERT merge clauses
- IMPALA-13656: MERGE redundantly accumulates memory in HDFS WRITER
- IMPALA-13324: Enable statement rewrite for merge queries for IcebergMergeImpl
- IMPALA-13655: UPDATE redundantly accumulates memory in HDFS WRITER
- IMPALA-13501: Clean up uncommitted Iceberg files after validation check failure
- IMPALA-13086: Lower AggregationNode estimate using stats predicate
- IMPALA-13305: Better thrift compatibility checks based on pyparsing
- IMPALA-13589: SELECT INPUT__FILE__NAME can crash Impala
- IMPALA-11265: Part2: Store Iceberg file descriptors in encoded format
- IMPALA-13370: Read Puffin stats from metadata.json property if available
- IMPALA-13495: Make exceptions from the Calcite planner easier to classify
- IMPALA-13484: Don't call alter_table() on HMS when loading Iceberg table
- IMPALA-13325: Use RowBatch::CopyRows in IcebergDeleteNode
- IMPALA-13467: Fix partition list size calculation for empty Iceberg scan nodes
- IMPALA-13247: Support Reading Puffin files for the current snapshot
- IMPALA-12861: Fix mixed file format listing for Iceberg tables
- IMPALA-13463: Impala should ignore case of Iceberg schema elements
- IMPALA-13425: Iceberg tables crash server with Calcite planner
- IMPALA-13364: Schema resolution doesn't work for migrated partitioned Iceberg tables that have complex types
- IMPALA-13220: Docs for Iceberg DROP PARTITION
- IMPALA-11265: Part1: Clear GroupContentFiles once used
- IMPALA-12732: Add support for MERGE statements for Iceberg tables
- IMPALA-13254: Optimize REFRESH for Iceberg tables
- IMPALA-12867: Filter files to OPTIMIZE based on file size
- IMPALA-13274: Filter out illegal output for certain join nodes
- IMPALA-13296: Check column compatibility earlier for table migration
- IMPALA-12850: Add better error message for REFRESH iceberg_tbl PARTITION(...)
- IMPALA-12857: Add flag to enable merge-on-read even if tables are configured with copy-on-write
- IMPALA-13088, IMPALA-13109: Use RoaringBitmap instead of sorted vector of int64s
- IMPALA-13085: Add warning and NULL out DECIMAL values in Iceberg metadata tables
- IMPALA-13079: Add support for FLOAT/DOUBLE in Iceberg metadata tables
- IMPALA-11499: Refactor UrlEncode function to handle special characters
- IMPALA-13035: Querying metadata tables from non-Iceberg tables throws IllegalArgumentException
- IMPALA-12973,IMPALA-11491,IMPALA-12651: Support BINARY nested in complex types in select list
- IMPALA-13002: Iceberg V2 tables with Avro delete files aren't read properly
- IMPALA-12543: Detect self-events before finishing DDL
- IMPALA-12990: Fix impala-shell handling of unset rows_deleted
- IMPALA-13003: Handle Iceberg AlreadyExistsException
- IMPALA-13006: Restrict Iceberg tables to Parquet
- IMPALA-12996: Add support for DATE in Iceberg metadata tables
- IMPALA-12810: Simplify IcebergDeleteNode and IcebergDeleteBuilder
- IMPALA-12991: Eliminate unnecessary SORT for Iceberg DELETEs
- IMPALA-12970: Fix ConcurrentModificationException for Iceberg table scans
-