This workflow creates a conversational bridge between Telegram / n8n Chat and Home Assistant.
It allows users to control smart home devices or request information using natural language (text or voice).
⸻
🔑 Key Features
Multi-channel input: Works with both Telegram and n8n’s chat interface.
Voice support: Telegram voice messages are transcribed to text using OpenAI Whisper.
AI-driven assistant: Google Gemini processes queries in natural language.
Home Assistant integration: Uses MCP client tools to execute actions like turning devices on/off, adjusting lights, or broadcasting messages.
Memory management: Short-term memory keeps context within conversations.
Smart reply routing: Responses are automatically sent back to the correct channel (Telegram or chat).
Telegram Trigger: Captures incoming Telegram messages (text or voice).
Bot Is Typing: Sends a “typing…” action to indicate the bot is working.
Voice or Text: Separates voice and text inputs.
Get Voice File → Speech to Text → Transcription to ChatInput: Handles Telegram voice notes by downloading the file, transcribing it, and preparing it for the chat pipeline.
When Chat Message Received: Captures messages from n8n’s built-in chat interface.
Process Messages: Normalizes incoming data (input text, source, session ID, voice flag).
Home Agent: Main AI agent that processes queries.
Google Gemini Chat Model: Language model for intent understanding and conversation.