Data Warehousing

Cloudera Data Platform Runtime data warehousing concepts provide an overview of Apache Hive, Apache Iceberg, and Apache Impala. Using familiar SQL statements, you query data in Hive, Iceberg, or Impala.

Apache Hive

Apache Hive Metastore Overview

Describes the HMS service that stores metadata for a number of services, including Hive, Spark, and Impala.

Apache Hive Overview

Presents key features of Hive and Hive architecture, unsupported interfaces, and installing Hive on Tez.

Apache Iceberg

Apache Iceberg Overview

Introduces Apache Iceberg, a table format for huge analytics datasets. You can efficiently query large Iceberg tables on your object stores or file system.

Apache Impala

Apache Impala Overview

Provides an overview and describes the key components of Apache Impala.