Who is this for?
This workflow template is designed for AI enthusiasts, developers, and privacy-conscious users who want to leverage the power of local large language models (LLMs) without sending data to external services. It's particularly valuable for those running Ollama locally who want intelligent routing between different specialized models.
What problem is this workflow solving?
When working with multiple local LLMs, each with different strengths and capabilities, it can be challenging to manually select the right model for each specific task. This workflow automatically analyzes user prompts and routes them to the most appropriate specialized Ollama model, ensuring optimal performance without requiring technical knowledge from the end user.
What this workflow does
This intelligent router:
- Analyzes incoming user prompts to determine the nature of the request
- Automatically selects the optimal Ollama model from your local collection based on task requirements
- Routes requests between specialized models for different tasks:
- Text-only models (qwq, llama3.2, phi4) for various reasoning and conversation tasks
- Code-specific models (qwen2.5-coder) for programming assistance
- Vision-capable models (granite3.2-vision, llama3.2-vision) for image analysis
- Maintains conversation memory for consistent interactions
- Processes everything locally for complete privacy and data security
Setup
- Ensure you have Ollama installed and running locally
- Pull the required models mentioned in the workflow using Ollama CLI (e.g.,
ollama pull phi4
)
- Configure the Ollama API credentials in n8n (default: http://127.0.0.1:11434)
- Activate the workflow and start interacting through the chat interface
How to customize this workflow to your needs
- Add or remove models from the router's decision framework based on your specific Ollama collection
- Adjust the system prompts in the LLM Router to prioritize different model selection criteria
- Modify the decision tree logic to better suit your specific use cases
- Add additional preprocessing steps for specialized inputs
This workflow demonstrates how n8n can be used to create sophisticated AI orchestration systems that respect user privacy by keeping everything local while still providing intelligent model selection capabilities.