Apache Iceberg Feature Support Matrix

Apache Iceberg is a cloud-native, high-performance open table format for organizing petabyte-scale analytic datasets on a file system or object store. Combined with Cloudera, you can build an Open Data Lakehouse architecture for multi-function analytics and deploy large-scale end-to-end pipelines. Cloudera Iceberg supports Hive, Impala, and Spark engines. The Iceberg feature support matrix table lists the Iceberg features and the engines supported.

Open Data Lakehouse on Cloudera simplifies advanced analytics on all data with a unified platform for structured and unstructured data and integrated data services to enable any analytics use case, from ML and BI to stream analytics and real-time analytics. Apache Iceberg is the foundational technology of the Open Lakehouse architecture.

The Apache Iceberg format specification describes the following versions of tables:
  • v1: Defines large analytic data tables using open format files.
  • v2: Specifies ACID-compliant tables, including row-level deletes and updates.

The following tabs list the latest Iceberg features, supported engines, and their earliest supported version of Cloudera and Cloudera Data Services:

Table 1. Iceberg feature support matrix
Iceberg feature Hive Impala Spark
Create table
Select
Insert
Update
Delete
Merge
Schema evolution
Partition evolution (Hidden partition) (Hidden partition)
Drop partition
Transaction support
Parquet support
Compaction
Equality deletes (read-only) (read-only) (read-only)
Positional deletes (read and write)
Merge-on-read
Copy-on-write
Optimistic concurrency control
Time travel
Rollback
Optimized Row Columnar (ORC) (read-only)
Avro support (read-only)
Snapshot isolation level
Serializable isolation level
Branching
Tagging
Fine-grained access control (Ranger) (HWC)
Table statistics
Partition statistics ( no Number of Distinct Values)
Geospatial types
Partition pruning
Z-Ordering
Bucket map join (Storage partitioned join)
Migration from external table
Load data inpath
Table 2. Hive feature availability
Iceberg feature Form factor Version introduced in
Create Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Select Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Insert Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Update Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Delete Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Merge Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Schema evolution Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Partition evolution Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Drop Partition Cloudera on cloud 7.3.1
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.8.5
Cloudera Data Warehouse on premises 1.5.4
Transaction support Cloudera on cloud 7.2.18
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Parquet support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Compaction Cloudera on cloud 7.3.1 500
Cloudera Base on premises 7.3.1 500
Cloudera Data Warehouse on cloud 1.9.1
Cloudera Data Warehouse on premises 1.5.4 SP1
Equality deletes Cloudera on cloud 7.3.1 500
Cloudera Base on premises 7.3.1 500
Cloudera Data Warehouse on cloud 1.9.1
Cloudera Data Warehouse on premises 1.5.4 SP1
Positional deletes Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Merge-on-read Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Copy-on-write Cloudera on cloud 7.3.1 500
Cloudera Base on premises 7.3.1 500
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Optimistic concurrency control Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Time travel Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Rollback Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Optimized row columnar support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Avro support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.1
Cloudera Data Warehouse on premises 1.5.4
Snapshot isolation level Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Avro support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.1
Cloudera Data Warehouse on premises 1.5.4
Branching Cloudera on cloud 7.3.1 500
Cloudera Base on premises 7.3.1 500
Cloudera Data Warehouse on cloud 1.9.1
Cloudera Data Warehouse on premises 1.5.4
Tagging Cloudera on cloud 7.3.1 500
Cloudera Base on premises 7.3.1 500
Cloudera Data Warehouse on cloud 1.9.1
Cloudera Data Warehouse on premises 1.5.4
Fine-grained access control (Ranger) Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Table statistics Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Geospatial types Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Partition pruning Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Bucket map join Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Migration from external table Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Load data inpath Cloudera on cloud 7.2.17
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.9.3
Cloudera Data Warehouse on premises 1.5.4
Table 3. Impala feature availability
Iceberg feature Form factor Version introduced in
Create Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Select Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Insert Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Update Cloudera on cloud 7.3.1.500
Cloudera Base on premises 7.3.1.500
Cloudera Data Warehouse on cloud 1.8.5
Cloudera Data Warehouse on premises 1.5.4
Delete Cloudera on cloud 7.2.18
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.8.1
Cloudera Data Warehouse on premises 1.5.4
Merge Cloudera on cloud 7.3.1.500
Cloudera Base on premises 7.3.1.500
Cloudera Data Warehouse on cloud 1.10.1
Cloudera Data Warehouse on premises 1.5.5
Schema evolution Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Partition evolution Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Drop Partition Cloudera on cloud 7.3.1.500
Cloudera Base on premises 7.3.1.500
Cloudera Data Warehouse on cloud 1.8.5
Cloudera Data Warehouse on premises 1.5.4
Single statement transaction support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Parquet support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Major compaction Cloudera on cloud 7.2.18
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.8.5
Cloudera Data Warehouse on premises 1.5.4
Sophisticated compaction Cloudera on cloud 7.3.1.500
Cloudera Base on premises 7.3.1.500
Cloudera Data Warehouse on cloud 1.10.1
Cloudera Data Warehouse on premises 1.5.5
Equality deletes Cloudera on cloud 7.3.1.500
Cloudera Base on premises 7.3.1.500
Cloudera Data Warehouse on cloud 1.9.1
Cloudera Data Warehouse on premises 1.5.4
Positional deletes Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Merge-on-read Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Optimistic concurrency control Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouseon premises 1.5.2
Time travel Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Rollback Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Optimized row columnar support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Avro support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Snapshot isolation level Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Fine-grained access control (Ranger) Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Table statistics Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Partition statistics Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Partition pruning Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Z-Ordering Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Migration from external table Cloudera on cloud 7.2.18
Cloudera Base on premises 7.3.1
Cloudera Data Warehouse on cloud 1.7.1
Cloudera Data Warehouse on premises 1.5.2
Load data inpath Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Warehouse on cloud 1.6.1
Cloudera Data Warehouse on premises 1.5.2
Table 4. Spark feature availability
Iceberg feature Form factor Version introduced in
Create Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Select Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Insert Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Update Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Delete Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Merge Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Schema evolution Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Partition evolution Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Transaction support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Parquet support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Compaction Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Equality delete (read-only) Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Positional deletes Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Merge-on-read Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Copy-on-write Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Optimistic concurrency control Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Time travel Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Rollback Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Optimized row columnar support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Avro support Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Snapshot isolation level Cloudera on cloud 7.3.1
Cloudera Base on premises 7.3.1
Cloudera Data Engineering on cloud 1.22
Cloudera Data Engineering on premises 1.5.4-h5
Serializable isolation level Cloudera on cloud 7.3.1
Cloudera Base on premises 7.3.1
Cloudera Data Engineering on cloud 1.22
Cloudera Data Engineering on premises 1.5.4-h5
Branching Cloudera on cloud 7.3.1
Cloudera Base on premises 7.3.1
Cloudera Data Engineering on cloud 1.22
Cloudera Data Engineering on premises 1.5.4-h5
Tagging Cloudera on cloud 7.3.1
Cloudera Base on premises 7.3.1
Cloudera Data Engineering on cloud 1.22
Cloudera Data Engineering on premises 1.5.4-h5
Table statistics Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Partition pruning Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2
Z-Ordering Cloudera on cloud 7.3.1
Cloudera Base on premises 7.3.1
Cloudera Data Engineering on cloud 1.22
Cloudera Data Engineering on premises 1.5.4-h5
Bucket map join Cloudera on cloud 7.2.18
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.20
Cloudera Data Engineering on premises 1.5.2
Migration from external table Cloudera on cloud 7.2.17
Cloudera Base on premises 7.1.9
Cloudera Data Engineering on cloud 1.19
Cloudera Data Engineering on premises 1.5.2