ReadyFlow overview: ADLS to Milvus [Technical Preview]

You can use the ADLS to Milvus [Technical Preview] ReadyFlow to consume PDF documents from ADLS, vectorize them using a HuggingFace model and write the results to Milvus.

This ReadyFlow consumes PDF documents from a source ADLS location, partitions the PDFs, chunks the data, vectorizes the data using a HuggingFace embedding model, and stores the results in Milvus vector DB. The default HuggingFace model is 'all-MiniLM-L12-v2'. A Milvus access token is required to run this flow. Define a KPI on the failure_WriteToMilvus connection to monitor failed write operations.

ADLS to Milvus [Technical Preview] ReadyFlow details
Source Cloudera managed ADLS
Source Format PDF
Destination Milvus
Destination Format Vector DB