This n8n template demonstrates how to use AI to compose or "stitch" separate images together to generate a new image which retains the source assets and consistent style.
Use cases are many: Try producing storyboard scenes with consistent characters, marketing material with existing product assets or trying on different articles on fashion!
Good to know
At time of writing, each image generated will cost $0.039 USD. See Gemini Pricing for updated info.
The model used in this workflow is geo-restricted! If it says model not found, it may not be available in your country or region.
How it works
We'll import our required assets via our Cloud storage using the HTTP node.
The images are then converted to base64 strings and aggregated so we can use it for our AI model.
Gemini's image generation model is used which takes all 3 images and a prompt that we define. Our prompt instructs the model on how to compose the final image.
Gemini generates a new image but uses the original 3 assets to do so. The consistency to the source images is very high and shows little signs of hallucinations!
Gemini's output is base64 so we use a "Convert to file" node to convert the data to binary.
The final binary image is then uploaded to Google Drive to complete the demonstration.
How to use
The manual trigger node is used as an example but feel free to replace this with other triggers such as webhook or even a form.
Technically, you should be able to compose even more images but of course, the generation will take longer and cost more.
Requirements
Gemini account for LLM and Image generation
Google drive for upload
Customising this workflow
AI Image editing can be used for many use-cases. Try a popular use-case such as virtual try-on for fashion or applying branding on editing image assets.