Back to Templates
  • +8

Extract, Transform LinkedIn Data with Bright Data MCP Server & Google Gemini

Created by

Ranjan Dailata

Last update

Last update 22 days ago

Share

Disclaimer

This template is only available on n8n self-hosted as it's making use of the community node for MCP Client.

Extract, Transform LinkedIn Data with Bright Data.png

Who this is for?

The Extract, Transform LinkedIn Data with Bright Data MCP Server & Google Gemini workflow is an automated solution that scrapes LinkedIn content via Bright Data MCP Server then transforms the response using a Gemini LLM. The final output is sent via webhook notification and also persisted on disk.

This workflow is tailored for:​

  1. Data Analysts : Who require structured LinkedIn datasets for analytics and reporting.

  2. Marketing and Sales Teams : Looking to enrich lead databases, track company updates, and identify market trends.

  3. Recruiters and Talent Acquisition Specialists : Who want to automate candidate sourcing and company research.

  4. AI Developers : Integrating real-time professional data into intelligent applications.

  5. Business Intelligence Teams : Needing current and comprehensive LinkedIn data to drive strategic decisions.

What problem is this workflow solving?

Gathering structured and meaningful information from the web is traditionally slow, manual, and error-prone.

This workflow solves:

  1. Reliable web scraping using Bright Data MCP Server LinkedIn tools.

  2. LinkedIn person and company web scrapping with AI Agents setup with the Bright Data MCP Server tools.

  3. Data extraction and transformation with Google Gemini LLM.

  4. Persists the LinkedIn person and company info to disk.

  5. Performs a Webhook notification with the LinkedIn person and company info.

What this workflow does?

This n8n workflow performs the following steps:

  1. Trigger: Start manually.

  2. Input URL(s): Specify the LinkedIn person and company URL.

  3. Web Scraping (Bright Data): Use Bright Data's MCP Server, LinkedIn tools for the person and company data extract.

  4. Data Transformation & Aggregation: Uses the Google LLM for handling the data transformation.

  5. Store / Output: Save results into disk and also performs a Webhook notification.

Pre-conditions

  1. Knowledge of Model Context Protocol (MCP) is highly essential. Please read this blog post - model-context-protocol
  2. You need to have the Bright Data account and do the necessary setup as mentioned in the Setup section below.
  3. You need to have the Google Gemini API Key. Visit Google AI Studio
  4. You need to install the Bright Data MCP Server @brightdata/mcp
  5. You need to install the n8n-nodes-mcp

Setup

  1. Please make sure to setup n8n locally with MCP Servers by navigating to n8n-nodes-mcp
  2. Please make sure to install the Bright Data MCP Server @brightdata/mcp on your local machine.
  3. Sign up at Bright Data.
  4. Navigate to Proxies & Scraping and create a new Web Unlocker zone by selecting Web Unlocker API under Scraping Solutions.
  5. Create a Web Unlocker proxy zone called mcp_unlocker on Bright Data control panel.
  6. In n8n, configure the Google Gemini(PaLM) Api account with the Google Gemini API key (or access through Vertex AI or proxy).
  7. In n8n, configure the credentials to connect with MCP Client (STDIO) account with the Bright Data MCP Server as shown below.
    MCPClientAccount.png
    Make sure to copy the Bright Data API_TOKEN within the Environments textbox above as API_TOKEN=<your-token>.
  8. Update the LinkedIn URL person and company workflow.
  9. Update the Webhook HTTP Request node with the Webhook endpoint of your choice.
  10. Update the file name and path to persist on disk.

How to customize this workflow to your needs

  1. Different Inputs: Instead of static URLs, accept URLs dynamically via webhook or form submissions.

  2. Data Extraction: Modify the LinkedIn Data Extractor node with the suitable prompt to format the data as you wish.

  3. Outputs: Update the Webhook endpoints to send the response to Slack channels, Airtable, Notion, CRM systems, etc.