Vector Database as a Big Data Analysis Tool for AI Agents
Workflows from the webinar "Build production-ready AI Agents with Qdrant and n8n".
This series of workflows shows how to build big data analysis tools for production-ready AI agents with the help of vector databases. These pipelines are adaptable to any dataset of images, hence, many production use cases.
- Uploading (image) datasets to Qdrant
- Set up meta-variables for anomaly detection in Qdrant
- Anomaly detection tool
- KNN classifier tool
For anomaly detection
- The first pipeline to upload an image dataset to Qdrant.
- The second pipeline is to set up cluster (class) centres & cluster (class) threshold scores needed for anomaly detection.
- The third is the anomaly detection tool, which takes any image as input and uses all preparatory work done with Qdrant to detect if it's an anomaly to the uploaded dataset.
For KNN (k nearest neighbours) classification
- The first pipeline to upload an image dataset to Qdrant.
- This pipeline is the KNN classifier tool, which takes any image as input and classifies it on the uploaded to Qdrant dataset.
To recreate both
You'll have to upload crops and lands datasets from Kaggle to your own Google Storage bucket, and re-create APIs/connections to Qdrant Cloud (you can use Free Tier cluster), Voyage AI API & Google Cloud Storage.
[This workflow] KNN classification tool
This tool takes any image URL, and as output, it returns a class of the object on the image based on the image uploaded to the Qdrant dataset (lands).
- An image URL is received via the Execute Workflow Trigger, which is then sent to the Voyage AI Multimodal Embeddings API to fetch its embedding.
- The image's embedding vector is then used to query Qdrant, returning a set of X similar images with pre-labeled classes.
- Majority voting is done for classes of neighbouring images.
- A loop is used to resolve scenarios where there is a tie in Majority Voting, and we increase the number of neighbours to retrieve.
- When the loop finally resolves, the identified class is returned to the calling workflow.