Vector database as a big data analysis tool for AI agents [3/3 - anomaly]

Created by

Jenny

Last update

Last update a month ago

Vector Database as a Big Data Analysis Tool for AI Agents

Workflows from the webinar "Build production-ready AI Agents with Qdrant and n8n".

This series of workflows shows how to build big data analysis tools for production-ready AI agents with the help of vector databases. These pipelines are adaptable to any dataset of images, hence, many production use cases.

For anomaly detection

The first pipeline to upload an image dataset to Qdrant.
The second pipeline is to set up cluster (class) centres & cluster (class) threshold scores needed for anomaly detection.
3. This is the third pipeline --- the anomaly detection tool, which takes any image as input and uses all preparatory work done with Qdrant to detect if it's an anomaly to the uploaded dataset.

For KNN (k nearest neighbours) classification

The first pipeline to upload an image dataset to Qdrant.
The second is the KNN classifier tool, which takes any image as input and classifies it on the uploaded to Qdrant dataset.

To recreate both

You'll have to upload crops and lands datasets from Kaggle to your own Google Storage bucket, and re-create APIs/connections to Qdrant Cloud (you can use Free Tier cluster), Voyage AI API & Google Cloud Storage.

[This workflow] Anomaly Detection Tool

This is the tool that can be used directly for anomalous images (crops) detection.
It takes as input (any) image URL and returns a text message telling if whatever this image depicts is anomalous to the crop dataset stored in Qdrant.

An Image URL is received via the Execute Workflow Trigger, which is used to generate embedding vectors using the Voyage AI Embeddings API.
The returned vectors are used to query the Qdrant collection to determine if the given crop is known by comparing it to threshold scores of each image class (crop type).
If the image scores lower than all thresholds, then the image is considered an anomaly for the dataset.