Scaling Namespaces and Optimizing Data Storage
Also available as:
PDF
loading table of contents...

Cluster balancing algorithm

The HDFS Balancer runs in iterations. Each iteration contains the following four steps: storage group classification, storage group pairing, block move scheduling, and block move execution.