Installing Spark

Apache Spark is included with CDH 5. To use Apache Spark with CDH 4, you must install both CDH and Spark on the hosts that will run Spark.

Installing Spark after Upgrading Cloudera Manager

If you have just upgraded Cloudera Manager from a version that did not support Spark, the Spark software is not installed automatically. (Upgrading Cloudera Manager does not automatically upgrade CDH or other managed services).

You can add Spark using parcels; go to the Hosts tab, and select the Parcels tab. You should see at least one Spark parcel available for download. See Parcels for detailed instructions on using parcels to install or upgrade Spark. If you do not see any Spark parcels available, click the Edit Settings button on the Parcels page to go to the Parcel configuration settings and verify that the Spark parcel repo URL (https://archive.cloudera.com/spark/parcels/latest/) has been configured in the Parcels configuration page. See Parcel Configuration Settings for more details.

Post Installation Configuration

See Managing Spark Using Cloudera Manager for instructions on adding the Spark service.