Associating a Python virtual environment with a Cloudera Data Engineering job
You can associate the Python virtual environment with a Cloudera Data Engineering job at the time of creation, or you can update an existing job.
Before you begin
- Download and configure the CDE CLI.
- Create a Python virtual environment Cloudera Data Engineering resource.
- Create a Cloudera Data Engineering job.
Steps
- Using the CDE CLI, run the
cde job update
command to associate a Python virtual environment with the job.cde job update --name pyspark-example --python-env-resource-name cde-python-env-resource
Before you begin
- Create a Python virtual environment Cloudera Data Engineering resource.
- Create a Cloudera Data Engineering job.
Steps
- In the Cloudera console, click the Data Engineering tile. The Cloudera Data Engineering Home page displays.
- Click Job Runs on the left navigation menu. The Job Runs page displays.
- Using the dropdown menu, select the virtual cluster containing the application you
want to manage.
- Click on the name of the job you want to modify.
- Go to the Configuration tab.
- Click Edit.
- In the Python Environment section, click Select Python Environment.
- Select the Python virtual environment resource you want to use, and then click Select Resource.
- At the bottom of the page, click Update and Run to run the job immediately, or click the drop-down arrow on the button and select Update to update the job without running it.