2. Typical HDP HA Cluster

When you configure HDP HA solution using RHEL, your Hadoop cluster comprises of the following two components:

  • RHEL HA cluster

  • Overall Hadoop cluster

[Important]Important

The RHEL HA cluster is a subset of the overall Hadoop cluster.

Typically, the overall Hadoop cluster must include the following types of machines:

  • The RHEL HA cluster machines. These machines must host only those master services that require HA (in this case, the NameNode and the JobTracker).

  • Master machines that run other master services such as Hive Server 2, HBase master, etc.

  • Slave machines that run slave daemons like Datanodes, TaskTracker, RegionServers etc.

  • Client machines.

  • A network that connects all the machines.

The following illustrates the topology of a sample Hadoop cluster when configured for HDP HA using RHEL:


loading table of contents...