(Optional) Configure the Sync Between Cloudera Data Science Workbench and PyCharm
Configuring what files PyCharm ignores can help you adhere to IT policies.
Before you configure syncing behavior between the remote
editor and Cloudera Data Science Workbench, ensure that you understand the policies set forth
by IT and the Site Administrator. For example, a policy might require that data remains within
the Cloudera Data Science Workbench deployment but allow you to download and edit code.
In your project, go to Preferences.
Depending on your operating system, Preferences may be called
Settings.
Go to Build, Execution, Deployment and select
Deployment.
On the Connection tab, add the following path to the
Root path field:
/home/cdsw
On the Excluded Paths tab, add any paths you want to exclude.
Cloudera recommends excluding the following paths at a minimum:
/home/cdsw/.local
/home/cdsw/.cache
/home/cdsw/.ipython
/home/cdsw/.ipython
/home/cdsw/.oracle_jre_usage
/home/cdsw/.pip
/home/cdsw/.pycharm_helpers
Optionally, add a Deployment path on the
Mappings tab if the code for your Cloudera Data Science Workbench
project lives in a subdirectory of the root path.
Expand Deployment in the left navigation and go to Options > Upload changed files automatically to the default server and set the behavior to adhere to the policies set forth by IT and the Site
Administrator.
Cloudera recommends setting the behavior to Automatic upload
because the data remains on the cluster while your changes get uploaded.
Sync for the project file(s) to your machine and begin editing.
This site uses cookies and related technologies, as described in our privacy policy, for purposes that may include site operation, analytics, enhanced user experience, or advertising. You may choose to consent to our use of these technologies, or