Cloudera AI Inference service: A Cloudera AI Inference service
instance must be created. For instructions on creating a Cloudera AI Inference service instance, see Create a Cloudera AI Inference service
instance.
In the Cloudera console, click
the Cloudera AI
tile.
Click Applications under Deployments on the left navigation menu.
The Applications home page displays.
Click the Deploy Application button.
The Applications / Create Application page displays.
From the Select Environment & Inference Service drop-down list,
select your Cloudera environment and the Cloudera AI Inference service instance within which you want to create the
application.
In the Name textbox, enter a name for the application.
In the Subdomain textbox, enter a subdomain name. The application
will be deployed at https://<subdomain>.<ai_inference_domain>.
In the Description textbox, add a description of the application.
For the Select Source field, choose Git if your application is from a Git repository or Docker if it is a container image.
For Git repositories: Enter the Git URL. For a private repository, enter the username and SSH key.
For Docker images: Enter the Docker Image URL. For a private image, enter the username and access token.
For the Authentication Type field, select SSO for a web application or JWT for an API-based application.
For the Resource Profile field, configure the following resource requirements for each application replica:
In the Instance Type field, select the instance type from the drop-down list.
In the GPU field, specify the number of GPUs each application replica requires.
In the CPU field, specify the number of CPUs each application replica requires.
In the Memory field, specify the amount of CPU memory each application replica requires.
Using the Autoscale Range slider bar, specify the minimum and
maximum number of replicas for the application. The autoscaler scales the replicas to manage
the incoming workload.
In the Environment Variables field, add any custom key and value pairs your application needs.
In the Tags field, add any custom key and value pairs your application needs.
Click the Create Application button to deploy the application.