Loading JSON via FTP to Qdrant Vector Database Embedding Pipeline

Created by

Ghaith Alsirawan

Last update

Last update 4 months ago

JSON structure format for blog articles

{
  "id": "article_001",
  "title": "reseguider",
  "language": "sv",
  "tags": ["london", "resa", "info"],
  "source": "alltomlondon.se",
  "url": "https://...",
  "embedded_at": "2025-04-08T15:27:00Z",
  "chunks": [
    {
      "chunk_id": "article_001_01",
      "section_title": "Introduktion",
      "text": "Välkommen till London..."
    },
    ...
  ]
}

🧰 Benefits

✅ Automated Vector Loading
Handles FTP → JSON → Qdrant in a hands-free pipeline.

✅ Clean Embedding Input
Supports pre-validated chunks with metadata: titles, tags, language, and article ID.

✅ AI-Ready Format
Perfect for Retrieval-Augmented Generation (RAG), semantic search, or assistant memory.

✅ Flexible Architecture
Modular and swappable: FTP can be replaced with GDrive/Notion/S3, and embeddings can switch to local models like Ollama.

✅ Community Friendly
This template helps others adopt best practices for vector DB feeding and LLM integration.

Loading JSON via FTP to Qdrant Vector Database Embedding Pipeline

JSON structure format for blog articles

🧰 Benefits

There’s nothing you can’t automate with n8n