Apache Spark Post Upgrade Migration Steps
After upgrading to CDH 6, you might have multiple Spark services configured, each with their own set of configurations, including event log locations. Decide which service to keep and
then manually merge the two services.
Manually merge your Spark services by performing the following steps:
- Copy all relevant configurations from the service you are removing to the service you are keeping. To view and edit the configurations:
- In the Cloudera Manager Admin Console, go to the Spark service you are removing.
- Click the Configuration tab.
- Note the configurations.
- Go to the Spark service you are keeping and replicate the configuration.
- Click Save Changes.
- To preserve historic event logs:
- Identify the location of the event logs for the service you are removing:
- In the Cloudera Manager Admin Console, go to the Spark service you are removing.
- Click the Configuration tab.
- Search for: spark.eventLog.dir
- Note the path.
- Log into a cluster host and run the following command:
hadoop fs -mv <old_Spark_Event_Log_dir>/* <new_location>/.
- Identify the location of the event logs for the service you are removing:
- Using Cloudera Manager, stop and delete the Spark Service you have selected for removal:
- In the Cloudera Manager Admin Console, click the drop-down arrow next to the Spark service you are removing, and then select Stop.
- Click the drop-down arrow next to the Spark service you are removing, and then select Delete.
- Restart the remaining Spark service: Click the drop-down arrow next to the Spark service, and then select Restart.