Creating a Model Endpoint using API

You can select a specific Cloudera AI Inference service instance and a model version from Cloudera AI Registry to create a new model endpoint.

Retrieve a registered model's model ID and model version.

This information is available in the Registered Models page in the Cloudera AI control plane UI.

Create the model specification for the selected model.

# cat ./examples/mlflow/model-spec-cml-registry.json

The following is the sample output:

{
  "namespace": "serving-default",
  "name": "mlflow-wine-test-from-registry-onnx",
  "source": {
    "registry_source": {
      "version": 2, 
      "model_id": "3azn-tmqe-wsze-5u4s"
    }
  }
}

export DOMAIN=$(cdp ml describe-ml-serving-app --app-crn [***app-crn***] | jq -r '.app.cluster.domainName')

Create the model endpoint by using the following Cloudera AI serving deployEndpoint API:
```
curl -v  -H "Content-Type: application/json" -H "Authorization: Bearer ${CDP_TOKEN}" "https://${DOMAIN}/api/v1alpha1/deployEndpoint" -d @./examples/mlflow/model-spec-cml-registry.json
```
The DOMAIN looks like ml-67814ad5-b79.eng-ml-d.xcu2-8y8x.dev.cldr.work.

You can retrieve the CDP_TOKEN by performing the steps from Preparing to interact with the Cloudera AI.

note

You can only specify serving-default as the namespace into which the Model Endpoint can be deployed.

Creating a Model Endpoint using API

We want your opinion

How can we improve this page?