Supported model artifact formats for Cloudera AI Inference service
Lists Cloudera AI Inference service supported models:
- Text-generating and embedding Large Language Models (LLMs) packaged as NVIDIA NIM.
- Predictive models in the ONNX representation, registered to Cloudera AI Registry from a Cloudera AI Workbench. See
Register an ONNX model to Cloudera AI Registry as an
example showing how to convert a Scikit-learn (
sklearn) model to ONNX and then register it to the Cloudera AI Registry. Refer to Export a PyTorch model to ONNX or Getting Started Converting TensorFlow to ONNX documentation regarding how models using these frameworks can be converted to the ONNX representation.
