Configuring what files PyCharm ignores can help you adhere to IT policies.
Before you configure syncing behavior between the remote
editor and Cloudera Data Science Workbench, ensure that you understand the policies set forth
by IT and the Site Administrator. For example, a policy might require that data remains within
the Cloudera Data Science Workbench deployment but allow you to download and edit code.
-
In your project, go to Preferences.
Depending on your operating system, Preferences may be called
Settings.
-
Go to Build, Execution, Deployment and select
Deployment.
-
On the Connection tab, add the following path to the
Root path field:
-
On the Excluded Paths tab, add any paths you want to exclude.
Cloudera recommends excluding the following paths at a minimum:
- /home/cdsw/.local
- /home/cdsw/.cache
- /home/cdsw/.ipython
- /home/cdsw/.ipython
- /home/cdsw/.oracle_jre_usage
- /home/cdsw/.pip
- /home/cdsw/.pycharm_helpers
-
Optionally, add a Deployment path on the
Mappings tab if the code for your Cloudera Data Science Workbench
project lives in a subdirectory of the root path.
-
Expand Deployment in the left navigation and go to and set the behavior to adhere to the policies set forth by IT and the Site
Administrator.
Cloudera recommends setting the behavior to Automatic upload
because the data remains on the cluster while your changes get uploaded.
-
Sync for the project file(s) to your machine and begin editing.