Using Cloudera's Distribution of Apache Spark 2
For an architectural overview of how Cloudera's Distribution of Apache Spark 2 works with Cloudera Data Science Workbench, see Overview: Cloudera Distribution of Apache Spark 2. The rest of this guide describes how to set Spark 2 environment variables, manage package dependencies, and how to configure logging. It also consists of instructions and sample code for running R, Scala, and Python projects from Spark 2.