Storage
Cloudera Runtime provides different types of storage components that you can use depending on your data requirements. Apache Hadoop HDFS is a distributed file system for storing large volumes of data. Apache Kudu completes Apache Hadoop’s storage layer, enabling fast analytics on fast data.
- HDFS Overview
- Provides an overview of Apache Hadoop HDFS, its benefits, and the key components.
- Apache Kudu Overview
- Introduces Apache Kudu, with information on using Apache Impala with Kudu, Kudu concepts, architecture, and usage limitations.
- Apache Kudu Design
- Outlines effective schema design philosophies for Apache Kudu, and how they differ from approaches used for traditional relational database schemas.