Scaling Namespaces and Optimizing Data Storage
Also available as:
PDF
loading table of contents...

Optimizing data storage

You can consider the following options to optimize data storage in HDFS clusters: balancing data across disks of a DataNode, balancing data across the DataNodes of a cluster, increasing storage space through erasure coding, applying storage policies for archiving cold data, and using codecs for compressing data.