Back to Templates

Search Google, Bing, Yandex & Extract Structured Results with Bright Data MCP & Google Gemini

Created by

Created by: Ranjan Dailata || ranjancse

Ranjan Dailata

Last update

Last update 23 days ago

Share


Perform Google, Bing, Yandex Search with Bright Data MCP Agent  Google Gemini.png

Notice

Community nodes can only be installed on self-hosted instances of n8n.

Who this is for?

The Search Engine Intelligence Extractor is a powerful n8n automation that leverages Bright Data’s MCP based AI Agents to simulate human-like searches across Google, Bing, and Yandex, and then distills clean, structured insights using Google Gemini.

This workflow is tailored for:

  • SEO analysts researching competitors or market trends

  • Market researchers needing real-time search visibility

  • Journalists & content writers gathering contextual insights

  • AI developers creating intelligent assistants

  • Digital marketers tracking brand mentions or news

What problem is this workflow solving?

Traditional scraping of search engines is often blocked, cluttered, or filled with irrelevant information. Manually analyzing and cleaning this data for insight is time-consuming.

This workflow solves the problem by:

  • Simulating real user search behavior via Bright Data MCP based AI Agent

  • Performing multi-platform search (Google, Bing, Yandex) in one unified flow

  • Extracting clean, human-readable results (stripping ads, navigation, etc.)

  • Structuring the content using Google Gemini LLM

  • Automating delivery via Webhook or saving to disk

What this workflow does

Input Fields Node:

  • Accepts the search query

  • Accepts action for example - Perform a google search. Replace the action with bing, yandex etc. for other search providers

  • Accepts Webhook notification URL

Bright Data MCP Agent Execution:

  • Triggers Bright Data’s intelligent search agent

  • Handles search navigation, result loading, pagination

Human Readable Data Extractor:

  • Cleanses HTML, removes ads, footers, irrelevant links

  • Produces a readable narrative of results

Final Output Handling:

  • Saves the processed response to disk

  • Sends the structured data to a Webhook for real-time use

Pre-conditions

  1. Knowledge of Model Context Protocol (MCP) is highly essential. Please read this blog post - model-context-protocol
  2. You need to have the Bright Data account and do the necessary setup as mentioned in the Setup section below.
  3. You need to have the Google Gemini API Key. Visit Google AI Studio
  4. You need to install the Bright Data MCP Server @brightdata/mcp
  5. You need to install the n8n-nodes-mcp

Setup

  1. Please make sure to setup n8n locally with MCP Servers by navigating to n8n-nodes-mcp
  2. Please make sure to install the Bright Data MCP Server @brightdata/mcp on your local machine.
  3. Sign up at Bright Data.
  4. Create a Web Unlocker proxy zone called mcp_unlocker on Bright Data control panel.
  5. Navigate to Proxies & Scraping and create a new Web Unlocker zone by selecting Web Unlocker API under Scraping Solutions.
  6. In n8n, configure the Google Gemini(PaLM) Api account with the Google Gemini API key (or access through Vertex AI or proxy).
  7. In n8n, configure the credentials to connect with MCP Client (STDIO) account with the Bright Data MCP Server as shown below.

MCPClientAccount.png

Make sure to copy the Bright Data API_TOKEN within the Environments textbox above as API_TOKEN=<your-token>

How to customize this workflow to your needs

Add Scheduled Execution

  • Add a Cron trigger to run this workflow on a set schedule (e.g., daily/weekly keyword tracking).

Push Results to Custom Destinations

Connect output to:

  • Google Sheets (for analytics or dashboards)

  • PostgreSQL or MySQL DBs (for structured storage)

  • Notion or Airtable (for content pipelines)

  • Slack or Email (for alerting teams)

Customize Webhook Notifications

  • Update the Webhook URL in the notification node to push processed results to external APIs, CRMs, or real-time dashboards.