Spark QuickStart Guide
Also available as:
PDF

Chapter 3. Installing Spark

To install Spark manually, see "Installing and Configuring Apache Spark" in the Manual Install Guide.

To install Spark on a Kerberized cluster, first read Installing Spark with Kerberos (the next topic in this Quick Start Guide).

The remainder of this section describes how to install Spark using Ambari. (For general information about installing HDP components using Ambari, see Adding a Service in the Ambari Documentation Suite.)

The following diagram shows the Spark installation process using Ambari.

To install Spark using Ambari, complete the following steps:

  1. Choose the Ambari "Services" tab.

    In the Ambari "Actions" pulldown menu, choose "Add Service." This will start the Add Service Wizard. You'll see the Choose Services screen.

    Select "Spark", and click "Next" to continue.

  2. On the Assign Masters screen, choose a node for the Spark History Server.

    Click "Next" to continue.

  3. On the Assign Slaves and Clients screen, specify the machine(s) that will run Spark clients.

    Click "Next" to continue.

  4. On the Customize Services screen there are no properties that must be specified. We recommend that you use default values for your initial configuration. Click "Next" to continue.

  5. Ambari will display the Review screen.

    [Important]Important

    On the Review screen, make sure all HDP components are version 2.2.6 or later.

    Click "Deploy" to continue.

  6. Ambari will display the Install, Start and Test screen. The status bar and messages will indicate progress.

  7. When finished, Ambari will present a summary of results. Click "Complete" to finish installing Spark.

[Caution]Caution

Ambari will create and edit several configuration files. Do not edit these files directly if you configure and manage your cluster using Ambari.