Migrating Apache Spark Before Upgrading to CDH 6
If you are upgrading to CDH 6 from CDH 5 and have a Spark service installed, there are several pre-upgrade steps you might need to take:
Remove Spark (Standalone) Service
- Log in to the Cloudera Manager Admin Console.
- Click the drop-down arrow next to the Spark (Standalone) service and select Stop.
- Click the drop-down arrow next to the Spark (Standalone) service and select Delete.
Set Alternatives Priorities for Built-in CDH 5 Spark (1.6) and CDS 2
If you are using both the built-in Spark 1.6 service in CDH 5 and a CDS 2 parcel, and both services have Gateway roles on the same hosts, increase the alternatives priority of the service that you want to use as the default service after the upgrade.
- Log in to the Cloudera Manager Admin Console.
- Select the Cluster where the Spark services are running.
- Select the Spark service that you want to use as the default service after upgrading.
- Click the Configuration tab.
- Search for the Alternatives Priority property.
- Set the value higher than the Alternatives Priority for any other Spark service.
- Click Save Changes.
Remove CDS 2 Version Higher than the CDH 6 Spark 2 Version
If you are using a CDS 2 minor version higher than the version of Spark 2 included in the CDH 6 release you are upgrading to (2.2 in CDH 6.0.0), you must remove your Spark 2 services from Cloudera Manager. For the purpose of this evaluation, you can ignore maintenance versions. For example, if the Spark 2 version in the CDH 6 version you are upgrading to is 2.2, and you are using any maintenance version of CDS 2.2.0, you do not need to remove your Spark 2 services from Cloudera Manager. They will be automatically converted to use the built-in CDH 6 Spark version, and the CDS parcel will be disabled.
Deleting a Spark service in Cloudera Manager does not delete the associated event logs from HDFS. The CDH 6 upgrade wizard installs Spark 2.2.
- Log in to the Cloudera Manager Admin Console.
- Select the Cluster where the Spark 2 service is running.
- Click the drop-down arrow next to the Spark service and select Stop.
- Click the drop-down arrow next to the Spark service and select Delete.
- After upgrading to CDH 6, add the Spark service. For instructions, see Adding a Service.