Turn unstructured pitch decks and investment memos into polished Due Diligence PDF reports automatically. This n8n workflow handles everything from document ingestion to final delivery, combining internal document analysis with live web research to produce analyst-grade output in minutes.
Reviewing a single deal manually reading the deck, cross-checking claims online, formatting the summary easily takes half a day. Multiply that by 10–20 inbound deals per week, and your team is buried in low-leverage work before any real analysis begins.
This workflow compresses that cycle into a single automated pipeline.
Each deal gets a unique namespace in Pinecone, so documents are isolated and repeat uploads skip redundant parsing.
| Service | Role |
|---|---|
| n8n | Workflow orchestration |
| LlamaIndex Cloud | Document parsing (LlamaParse) |
| Pinecone | Vector storage & retrieval |
| OpenAI API | Embeddings (text-embedding-3-small) & LLM analysis (GPT-5.4) |
| Decodo API | Web search & page scraping |
| Cloudflare R2 | Report file storage (S3-compatible) |
| Symptom | Likely Fix |
|---|---|
| Parsing times out | Increase the Wait node duration; check file size against LlamaParse limits |
| Thin or generic analysis | Verify the source PDF is text-based, not a scanned image, enable OCR if needed |
| Broken PDF layout | Simplify CSS in the HTML render node; older Puppeteer builds handle basic layouts better |
Created by: Khmuhtadin
Category: Business Intelligence | Tags: AI, RAG, Due Diligence, Decodo