HDFS OverviewPDF version

Introduction

Hadoop Distributed File System (HDFS) is a Java-based file system that provides scalable and reliable data storage. An HDFS cluster contains a NameNode to manage the cluster namespace and DataNodes to store data.

We want your opinion

How can we improve this page?

What kind of feedback do you have?