Search Google, Bing, Yandex & Extract Structured Results with Bright Data MCP & Google Gemini

Created by

Ranjan Dailata

Last update

Last update 23 days ago

Notice

Community nodes can only be installed on self-hosted instances of n8n.

Who this is for?

The Search Engine Intelligence Extractor is a powerful n8n automation that leverages Bright Data’s MCP based AI Agents to simulate human-like searches across Google, Bing, and Yandex, and then distills clean, structured insights using Google Gemini.

This workflow is tailored for:

SEO analysts researching competitors or market trends
Market researchers needing real-time search visibility
Journalists & content writers gathering contextual insights
AI developers creating intelligent assistants
Digital marketers tracking brand mentions or news

What problem is this workflow solving?

Traditional scraping of search engines is often blocked, cluttered, or filled with irrelevant information. Manually analyzing and cleaning this data for insight is time-consuming.

This workflow solves the problem by:

Simulating real user search behavior via Bright Data MCP based AI Agent
Performing multi-platform search (Google, Bing, Yandex) in one unified flow
Extracting clean, human-readable results (stripping ads, navigation, etc.)
Structuring the content using Google Gemini LLM
Automating delivery via Webhook or saving to disk

What this workflow does

Input Fields Node:

Accepts the search query
Accepts action for example - Perform a google search. Replace the action with bing, yandex etc. for other search providers
Accepts Webhook notification URL

Bright Data MCP Agent Execution:

Triggers Bright Data’s intelligent search agent
Handles search navigation, result loading, pagination

Human Readable Data Extractor:

Cleanses HTML, removes ads, footers, irrelevant links
Produces a readable narrative of results

Final Output Handling:

Saves the processed response to disk
Sends the structured data to a Webhook for real-time use

Pre-conditions

Knowledge of Model Context Protocol (MCP) is highly essential. Please read this blog post - model-context-protocol
You need to have the Bright Data account and do the necessary setup as mentioned in the Setup section below.
You need to have the Google Gemini API Key. Visit Google AI Studio
You need to install the Bright Data MCP Server @brightdata/mcp
You need to install the n8n-nodes-mcp

Setup

Please make sure to setup n8n locally with MCP Servers by navigating to n8n-nodes-mcp
Please make sure to install the Bright Data MCP Server @brightdata/mcp on your local machine.
Sign up at Bright Data.
Create a Web Unlocker proxy zone called mcp_unlocker on Bright Data control panel.
Navigate to Proxies & Scraping and create a new Web Unlocker zone by selecting Web Unlocker API under Scraping Solutions.
In n8n, configure the Google Gemini(PaLM) Api account with the Google Gemini API key (or access through Vertex AI or proxy).
In n8n, configure the credentials to connect with MCP Client (STDIO) account with the Bright Data MCP Server as shown below.

Make sure to copy the Bright Data API_TOKEN within the Environments textbox above as API_TOKEN=<your-token>

How to customize this workflow to your needs

Add Scheduled Execution

Add a Cron trigger to run this workflow on a set schedule (e.g., daily/weekly keyword tracking).

Push Results to Custom Destinations

Connect output to:

Google Sheets (for analytics or dashboards)
PostgreSQL or MySQL DBs (for structured storage)
Notion or Airtable (for content pipelines)
Slack or Email (for alerting teams)

Customize Webhook Notifications

Update the Webhook URL in the notification node to push processed results to external APIs, CRMs, or real-time dashboards.