Storage

Cloudera Runtime provides different types of storage components that you can use depending on your data requirements. Apache Hadoop HDFS is a distributed file system for storing large volumes of data. Apache Ozone is a scalable, redundant, and distributed object store optimized for big data workloads. Apache Kudu completes Apache Hadoop’s storage layer, enabling fast analytics on fast data.

Provides an overview of Apache Hadoop HDFS, its benefits, and the key components.

Provides an overview of Apache Ozone and its key components.

Provides an overview of Apache HBase database along with its benefits.

Introduces Apache Kudu, with information on using Apache Impala with Kudu, Kudu concepts, architecture, and usage limitations.

Provides information about the running background tasks that are important for many maintenance activities.