Cloudera AI Inference service using OpenAI Python SDK client on a local machine
Consider the following guidelines for Cloudera AI Inference service using OpenAI Python SDK client.
Specify the
CDP_TOKEN
as the API_KEY. A common way
is to export CDP_TOKEN
as an environment variable and access that from
your Python code:
from openai import OpenAI import os API_KEY = os.environ['CDP_TOKEN'] MODEL_NAME = [***MODEL_NAME***] client = OpenAI( base_url = "[***BASE_URL***]", api_key = API_KEY, ) completion = client.chat.completions.create( model=MODEL_NAME, messages=[{"role":"user","content":"Write a one-sentence definition of GenAI."}], temperature=0.2, top_p=0.7, max_tokens=1024, stream=True ) for chunk in completion: if chunk.choices[0].delta.content is not None: print(chunk.choices[0].delta.content, end="")