Storage

Cloudera Runtime provides different types of storage components that you can use depending on your data requirements. Apache Hadoop HDFS is a distributed file system for storing large volumes of data. Apache Hadoop Ozone is a scalable, redundant, and distributed object store optimized for big data workloads. Apache Kudu completes Apache Hadoop’s storage layer, enabling fast analytics on fast data.

Apache Hadoop HDFS

Apache Hadoop HDFS Overview

Provides an overview of Apache Hadoop HDFS, its benefits, and the key components.

Apache Hadoop Ozone

Apache Hadoop Ozone Overview

Provides an overview of Apache Hadoop Ozone and its key components.

Apache Kudu

Apache Kudu Overview

Introduces Apache Kudu, with information on using Apache Impala with Kudu, Kudu concepts, architecture, and usage limitations.

Apache Kudu Design

Outlines effective schema design philosophies for Apache Kudu, and how they differ from approaches used for traditional relational database schemas.