Achieving optimal results from your Hadoop implementation begins with choosing appropriate hardware and software. The effort involved in the planning stages can pay off dramatically in terms of the performance and the total cost of ownership (TCO) associated with the environment.
The following system stack recommendations can help during planning stages:
Machine Type |
Workload Pattern/ Cluster Type |
Storage |
Processor (# of Cores) |
Memory (GB) |
Network |
---|---|---|---|---|---|
Slave Nodes |
Balanced workload |
Twelve 2-3 TB disks | 8 | 128-256 |
1 GB onboard, 2x10 GBE mezzanine/external |
Slave Nodes |
Compute-intensive workload |
Twelve 1-2 TB disks |
10 |
128-256 |
1 GB onboard, 2x10 GBE mezzanine/external |
Slave Nodes |
Storage-heavy workload |
Twelve 4+ TB disks |
8 |
128-256 |
1 GB onboard, 2x10 GBE mezzanine/external |
NameNode |
Balanced workload |
Four or more 2-3 TB RAID 10 with spares |
8 |
128-256 |
1 GB onboard, 2x10 GBE mezzanine/external |
ResourceManager |
Balanced workload |
Four or more 2-3 TB RAID 10 with spares |
8 |
128-256 |
1 GB onboard, 2x10 GBE mezzanine/external |