List of required configuration parameters for the ADLS to Pinecone [Technical Preview] ReadyFlow

When deploying the ADLS to Pinecone [Technical Preview] ReadyFlow, you have to provide the following parameters. Use the information you collected in Prerequisites.

Table 1. ADLS to Pinecone [Technical Preview] ReadyFlow configuration parameters
Parameter name Description
CDP Workload User Specify the Cloudera machine user or workload username that you want to use to authenticate to the object stores. Ensure this user has the appropriate access rights to the object store locations in Ranger or IDBroker.
CDP Workload User Password Specify the password of the Cloudera machine user or workload user you are using to authenticate against the object stores.
CDPEnvironment The Cloudera Environment configuration resources.
OpenAI API Key Specify the API key used to authenticate to OpenAI.
OpenAI Model

Specify the OpenAI model name to use for embedding the data.

The default model is 'text-embedding-ada-002'.

Pinecone API Key Specify the API key used to authenticate to Pinecone.
Pinecone Environment Specify the name of the Pinecone environment.
Pinecone Index Specify the name of the Pinecone index. To ensure correct functionality of this ReadyFlow, make sure your Pinecone index has been created with the same dimension as the 'text-embedding-ada-002' model (1536).
Source ADLS File System Specify the name of the ADLS data container you want to read from. The full path will be constructed from: abfs://#{Source ADLS File System}@#{Source ADLS Storage Account}.dfs.core.windows.net/#{Source ADLS Path}
Source ADLS Path Specify the path within the ADLS data container where you want to read from without any leading characters. The full path will be constructed from: abfs://#{Source ADLS File System}@#{Source ADLS Storage Account}.dfs.core.windows.net/#{Source ADLS Path}
Source ADLS Storage Account Specify the source ADLS storage account name.