Creating a Project with Legacy Engine Variants
To create a Cloudera Data Science Workbench project:
- Go to Cloudera Data Science Workbench and on the left sidebar, click Projects.
- Click New Project.
- If you are a member of a team, from the drop-down menu, select the Account under which you want to create this project. If there is only one account on the deployment, you will not see this option.
- Enter a Project Name.
Select Project Visibility from one of the following options.
- Private - Only project collaborators can view or edit the project.
- Team - If the project is created under a team account, all members of the team can view the project. Only explicitly-added collaborators can edit the project.
- Public - All authenticated users of Cloudera Data Science Workbench will be able to view the project. Collaborators will be able to edit the project.
Under Initial Setup, you can either create a blank project, or select one of
the following sources for your project files.
Built-in Templates - Template projects contain example code that can help you get started with the Cloudera Data Science Workbench. They are available in R, Python, PySpark, and Scala. Using a template project is not required, but it helps you start using the Cloudera Data Science Workbench right away.
Custom Templates - Starting with version 1.3, site administrators can add template projects that are customized for their organization's use-cases. For details, see Custom Template Projects.
- Local - If you have an existing project on your local disk, use this option to upload compressed files or folders to Cloudera Data Science Workbench.
Git - If you already use Git for version control and collaboration, you
can continue to do so with the Cloudera Data Science Workbench. Specifying a
Git URL will clone the project into Cloudera Data Science Workbench. If you use
a Git SSH URL, your personal private SSH key will be used to clone the
repository. This is the recommended approach. However, you must add the public
SSH key from your personal Cloudera Data Science Workbench account to the
remote Git hosting service before you can clone the project. Specify your
username and password in the URL as
Click Create Project.
After the project is created, you can see your project files and the list of jobs defined in your project.
R node_modules *.pyc .* !.gitignore
- (Optional) To work with team members on a project, use the instructions in the following section to add them as collaborators to the project.