4. Server Node Hardware Recommendations

The following recommendations provide insights into the best practices for selecting the number of nodes, storage options per node (number of disks, size of disks, MTBF, and the replication cost of disk failures), compute power per node (sockets, cores, clock speed), RAM per node, and network capability (number, speed of ports).

While the hardware considerations in this section are generally applicable to all the servers in the Hadoop and HBase cluster, the focus here is on the slave nodes (DataNodes, TaskTrackers, and RegionServers). Slave nodes represent the majority of the infrastructure. (This section provides a general guideline for bal­anced workloads on slave nodes in production environments.)

[Note]Note

Hadoop cluster nodes do not require many features typically found in an enterprise data center server.

In this section: