QueryPinecone

Description:

Queries Pinecone in order to gather a specified number of documents that are most closely related to the given query.

Tags:

pinecone, vector, vectordb, vectorstore, embeddings, ai, artificial intelligence, ml, machine learning, text, LLM

Properties:

In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

Display NameAPI NameDefault ValueDescription
Embedding ModelEmbedding ModelOpenAI ModelSpecifies which embedding model should be used in order to create embeddings from incoming Documents. Default model is OpenAI.
Pinecone API KeyPinecone API KeyThe API Key to use in order to authentication with Pinecone
Sensitive Property: true
OpenAI API KeyOpenAI API KeyThe API Key for OpenAI in order to create embeddings
Sensitive Property: true
HuggingFace API KeyHuggingFace API KeyThe API Key for interacting with HuggingFace
Sensitive Property: true
Pinecone EnvironmentPinecone EnvironmentThe name of the Pinecone Environment. This can be found in the Pinecone console next to the API Key.
Index NameIndex NameThe name of the Pinecone index.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
QueryQueryThe text of the query to send to Pinecone.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Number of ResultsNumber of Results10The number of results to return from Pinecone
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Text KeyText KeytextThe key in the document that contains the text to create embeddings for.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
NamespaceNamespaceThe name of the Pinecone Namespace to query into.
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
Metadata FilterMetadata FilterOptional metadata filter to apply with the query. For example: { "author": {"$eq": "john.doe"} }
Supports Expression Language: true (will be evaluated using flow file attributes and Environment variables)
OpenAI ModelOpenAI Modeltext-embedding-ada-002The name of the OpenAI model to use
HuggingFace ModelHuggingFace Modelsentence-transformers/all-MiniLM-L6-v2The name of the HuggingFace model to use
Output StrategyOutput StrategyRow-OrientedSpecifies whether the output should contain only the text of the documents (each document separated by \n\n), or if it should be formatted as either single column-oriented JSON object, consisting of a keys 'ids', 'embeddings', 'documents', 'distances', and 'metadatas'; or if the results should be row-oriented, a JSON per line, each consisting of a single id, document, metadata, embedding, and distance.
Results FieldResults FieldIf the input FlowFile is JSON Formatted, this represents the name of the field to insert the results. This allows the results to be inserted into "an existing input in order to enrich it. If this property is unset, the results will be written to the FlowFile contents, overwriting any pre-existing content.
Include MetadataInclude MetadatatrueWhether or not to include the Documents' Metadata in the response
Include DistancesInclude DistancestrueWhether or not to include the Documents' Distances (i.e., how far the Document was away from the query) in the response