Optional: Spark Manual Upgrade Procedure
(Optional) Upgrade Spark from 1.3.1 to 1.4.1. As root:
Stop Spark 1.3.1:
su - spark -c "/usr/hdp/current/spark-client/sbin/stop-history-server.sh"
.Remove Spark 1.3.1:
yum erase "spark*"
.Add the node where you want Spark 1.4.1 History Server to run:
su - root
wget -nv http://s3.amazonaws.com/dev.hortonworks.com/HDP/centos6/2.x/BUILDS/2.3.2.0-2950/hdpbn.repo -O /etc/yum.repos.d/Spark141TP.repo
yum install spark_2_3_2_0_2950-master -y
To use Python:
yum install spark_2_3_2_0_2950-python
conf-select create-conf-dir --package spark --stack-version 2.3.2.0-2950 --conf-version 0
cp /etc/spark/2.3.0.0-2950/0/* /etc/spark/2.3.2.0-2950/0/
conf-select set-conf-dir --package spark --stack-version 2.3.2.0-2950 --conf-version 0
hdp-select set spark-client 2.3.2.0-2950
hdp-select set spark-historyserver 2.3.2.0-2950
Validate the Spark installation. As user spark, run SparkPI example:
su - spark -c "cd /usr/hdp/current/spark-client"
./bin/spark-submit --class org.apache.spark.examples.SparkPi --master yarn-client --num-executors 3 --driver-memory 512m --executor-memory 512m --executor-cores 1 lib/spark-examples*.jar 10
Restart Spark on YARN in either yarn-cluster mode or yarn-client mode:
yarn-cluster mode:
./usr/hdp/current/spark-client/bin/spark-submit --class path.to.your.Class --master yarn-cluster [options] <app jar> [app options]
yarn-client mode:
./usr/hdp/current/spark-client/bin/spark-shell --master yarn-client