Cloudera AI Inference service

Making an inference call to a Model Endpoint with an OpenAI API

Language models for text generation are deployed using NVIDIA’s NIM microservices. These model endpoints are compliant with the OpenAI Protocol. See NVIDIA NIM documentation for supported OpenAI APIs and NVIDIA NIM specific extensions such as Function Calling and Structured Generation.

We want your opinion

How can we improve this page?

What kind of feedback do you have?