Required parameters
When deploying the S3 to Chroma DB [Technical Preview] ReadyFlow, you have to provide the following parameters. Use the information you collected in Prerequisites.
Parameter Name | Description |
---|---|
CDP Workload User | Specify the Cloudera machine user or workload username that you want to use to authenticate to the object stores (via IDBroker). Ensure this user has the appropriate access rights to the object store locations. |
CDP Workload User Password | Specify the password of the Cloudera machine user or workload user you are using to authenticate against the object stores (via IDBroker). |
CDPEnvironment | The Cloudera Environment configuration resources. |
Chroma Collection Name | Specify the name of the Chroma collection to write to. |
Chroma Server Authentication Token | Specify the authentication token used to authenticate to Chroma. |
Chroma Server Hostname | Specify the Chroma server host name. |
Chroma Server Port | Specify the Chroma server port. The default value is "443". |
OpenAI API Key | Specify the API key used to authenticate to OpenAI. |
OpenAI Model Name | Specify the OpenAI model name to use for embedding the data.
The default model is 'text-embedding-ada-002'. |
Source S3 Bucket | Specify the name of the source S3 bucket you want to read from. |
Source S3 Path | Specify the path within the source bucket where you want to read files from. |