Back to Templates

Multi-modal expense tracking with GPT-4, Gemini OCR, and voice via Telegram

Created by

Created by: Oussama || oussama
Oussama

Last update

Last update 2 months ago

Categories

Share


This n8n template creates an intelligent expense tracking system ๐Ÿค– that processes text, voice, and receipt images through Telegram. The assistant automatically categorizes expenses, handles currency conversions ๐ŸŒ, and maintains financial records in Google Sheets while providing smart spending insights ๐Ÿ’ก.

Use Cases:

  • ๐Ÿ—ฃ๏ธ Personal expense tracking via Telegram chat
  • ๐Ÿงพ Receipt scanning and data extraction
  • ๐Ÿ’ฑ Multi-currency expense management
  • ๐Ÿ“‚ Automated financial categorization
  • ๐ŸŽ™๏ธ Voice-to-expense logging
  • ๐Ÿ“Š Daily/weekly/monthly spending analysis

How it works:

  1. Multi-Input Processing: Telegram trigger captures text messages, voice notes, and receipt images.
  2. Content Analysis: A Switch node routes different input types (text, audio, images) to appropriate processors.
  3. Voice Processing: ElevenLabs converts voice messages to text for expense extraction.
  4. Receipt OCR: Google Gemini analyzes receipt images to extract amounts and descriptions.
  5. Expense Classification: An LLM determines if the input is an expense or a general query.
  6. Expense Parsing: For multiple expenses, the AI splits and normalizes each item.
  7. Currency Conversion: An exchange rate API converts foreign currencies to USD.
  8. Smart Categorization: The AI agent assigns expenses to predefined categories with emojis.
  9. Data Storage: Google Sheets stores all expense records with automatic totals.
  10. Intelligent Responses: The agent provides spending summaries, alerts, and financial insights.

Requirements:

  • ๐ŸŒ Telegram Bot API access
  • ๐Ÿค– OpenAI, Gemini, or any other AI model
  • ๐Ÿ—ฃ๏ธ ElevenLabs API for voice processing
  • ๐Ÿ“ Google Sheets API access
  • ๐Ÿ’น Exchange rate API access

Good to know:

  • โš ๏ธ Daily spending alerts trigger when expenses exceed 100 USD.
  • ๐Ÿท๏ธ Supports 12 predefined expense categories with emoji indicators.
  • ๐Ÿ”„ Automatic currency detection and conversion to USD.
  • ๐ŸŽค Voice messages are processed through speech-to-text.
  • ๐Ÿ“ธ Receipt images are analyzed using computer vision.

Customizing this workflow:

  • โœ๏ธ Modify expense categories in the system prompt.
  • ๐Ÿ“ˆ Adjust spending alert thresholds.
  • ๐Ÿ’ต Change the base currency from USD to your preferred currency.
  • โœ… Add additional expense validation rules.
  • ๐Ÿ”— Integrate with other financial platforms.