📱🤖 Create Stunning AI Images Directly from WhatsApp with Gemini
This workflow transforms your WhatsApp into a personal AI image generation studio. Simply send a text message with your idea, and this bot will use the advanced prompt engineering capabilities of Gemini 2.5 Pro to craft a detailed, artistic prompt. It then uses Gemini 2.0 Flash to generate a high-quality image from that prompt and sends it right back to your chat.
It's a powerful yet simple way to bring your creative ideas to life, all from the convenience of your favorite messaging app!
What this workflow does
- Listens for WhatsApp Messages: The workflow starts automatically when you send a message to your connected WhatsApp number.
- Enhances Your Idea with AI: It takes your basic text (e.g., "a knight on a horse") and uses Gemini 2.5 Pro to expand it into a rich, detailed prompt perfect for image generation (e.g., "A cinematic, full-body shot of a stoic knight in gleaming, ornate silver armor, riding a powerful black warhorse through a misty, ancient forest. The scene is lit by ethereal morning sunbeams piercing the canopy, creating dramatic volumetric lighting and long shadows. Photorealistic, 8K, ultra-detailed, award-winning fantasy concept art.").
- Generates a Unique Image: It sends this enhanced prompt to the Google Gemini 2.0 Flash image generation API.
- Prepares the Image: The API returns the image in Base64 format, and the workflow instantly converts it into a binary file.
- Sends it Back to You: The final, high-quality image is sent directly back to you in the same WhatsApp chat.
Nodes Used
- 🟢 WhatsApp Trigger: The entry point that listens for incoming messages.
- 🧠 LangChain Chain (LLM): Uses Gemini 2.5 Pro for advanced prompt engineering.
- ➡️ HTTP Request: Calls the Google Gemini 2.0 Flash API to generate the image.
- 🔄 Convert to File: Converts the Base64 image data into a sendable file format.
- 💬 WhatsApp: Sends the final image back to the user.
Prerequisites
To use this workflow, you will need:
- An n8n instance.
- A WhatsApp Business Account connected to n8n. You can find instructions on how to set this up in the n8n docs.
- A Google Gemini API Key. You can get one for free from Google AI Studio.
How to use this workflow
- Get your Google Gemini API Key: Visit the Google AI Studio and create a new API key.
- Configure the Gemini 2.5 Pro Node:
- In the n8n workflow, select the Gemini 2.5 Pro node.
- Under 'Connect your account', click 'Create New' to add a new credential.
- Paste your Gemini API key from the previous step and save.
- Configure the Generate Image (HTTP Request) Node:
- Select the Generate Image node.
- In the properties panel on the right, find the Query Parameters section.
- In the 'Value' field for the
key
parameter, replace "Your API Key"
with your actual Google Gemini API Key.
- Connect WhatsApp:
- Select the WhatsApp Trigger node.
- Follow the instructions to connect your WhatsApp Business Account credential. If you haven't created one, the node will guide you through the process.
- Activate and Test:
- Save the workflow using the button at the top right.
- Activate the workflow using the toggle switch.
- Send a message to your connected WhatsApp number (e.g., "A futuristic city in the style of Van Gogh"). The bot will process your request and send a stunning AI-generated image right back to you!