Apache Hadoop HDFS

Hadoop Distributed File System (HDFS) is a Java-based file system that provides scalable and reliable data storage. An HDFS cluster contains a NameNode to manage the cluster namespace and DataNodes to store data.

To understand the concepts of HDFS, you can refer to the HDFS overview and introduction documentation.

For managing, configuring, and accessing HDFS, refer to the HDFS How to documentation.