4. System Stack Recommendations

Achieving optimal results from your Hadoop implementation begins with choosing appropriate hardware and software. The effort involved in the planning stages can pay off dramatically in terms of the performance and the total cost of ownership (TCO) associated with the environment.

The following system stack recommendations can help during planning stages:

Machine Type

Workload Pattern/ Cluster Type

Storage

Processor (# of Cores)

Memory (GB)

Network

Slave Nodes

Balanced workload

Twelve 2-3 TB disks

8128-256

1 GB onboard, 2x10 GBE mezzanine/external

Slave Nodes

Compute-intensive workload

Twelve 1-2 TB disks

10

128-256

1 GB onboard, 2x10 GBE mezzanine/external

Slave Nodes

Storage-heavy workload

Twelve 4+ TB disks

8

128-256

1 GB onboard, 2x10 GBE mezzanine/external

NameNode

Balanced workload

Four or more 2-3 TB RAID 10 with spares

8

128-256

1 GB onboard, 2x10 GBE mezzanine/external

ResourceManager

Balanced workload

Four or more 2-3 TB RAID 10 with spares

8

128-256

1 GB onboard, 2x10 GBE mezzanine/external