In Cloudera Data Science Workbench clusters that use an HTTP proxy, follow these steps
to support web-related actions in Spark. You must set the Spark configuration parameter
extraJavaOptions
on your gateway hosts.
-
Log in to Cloudera Manager.
-
Go to .
-
Filter the properties with and .
-
Scroll down to Spark 2 Client Advanced Configuration Snippet (Safety Valve)
for spark2-conf/spark-defaults.conf.
-
Enter the following configuration code, substituting your proxy host and port
values:
spark.driver.extraJavaOptions= \
-Dhttp.proxyHost=<YOUR HTTP PROXY HOST> \
-Dhttp.proxyPort=<HTTP PORT> \
-Dhttps.proxyHost=<YOUR HTTPS PROXY HOST> \
-Dhttps.proxyPort=<HTTPS PORT>
-
Click Save Changes.
-
Choose .