Back to Templates

Find High-Intent Sales Leads by Scraping Glassdoor with Bright Data & GPT

Created by

Created by: Yaron Been || yaron-nofluff

Yaron Been

Last update

Last update 2 months ago

Categories

Share


🔍 Scrape Glassdoor with Bright Data

Designed for sales teams, recruiters, and marketers aiming to automate job discovery and prospecting.

This workflow scrapes Glassdoor job listings using Bright Data and automatically generates targeted pitches using AI, streamlining lead identification and outreach.


🧩 How It Works

This automation leverages n8n, Bright Data, Google Sheets, and OpenAI:

1. Trigger

  • Starts with a custom form input (Location, Keyword, Country).

2. Bright Data Job Scrape

  • Triggers a Bright Data dataset snapshot via HTTP Request.
  • Polls snapshot progress using a Wait node, ensuring data readiness.
  • Retrieves full job listings dataset once ready.

3. Google Sheets Integration

  • Writes detailed job data (company, role, location, overview, metrics) into a Google Sheet.
  • Uses a pre-built template for organized data storage.

4. Automated Pitch Generation (AI)

  • Splits listings into actionable parts: company name, title, and description.
  • Sends data to OpenAI (via LangChain) to generate relevant pitches or icebreakers.
  • Saves generated content back into the same sheet for easy access.

✅ Requirements

Ensure you have the following:

Google Sheets

  • Google account
  • Template Sheet with columns for job details and AI-generated pitches

Bright Data

  • Active account with Dataset API access
  • API key and dataset ID

OpenAI

  • Valid OpenAI API key for GPT models

n8n Environment

  • Nodes: HTTP Request, Wait, If, Google Sheets, Split Out, LangChain (OpenAI)
  • Credentials:
    • Google Sheets OAuth2
    • Bright Data API credentials
    • OpenAI API key

⚙️ Setup Instructions

Step 1: Prepare Google Sheets

Step 2: Import & Configure Workflow in n8n

  • Import the workflow JSON file
  • Set Google Sheets node:
    • Link to your copied sheet
    • Confirm correct tab name

Step 3: Configure Bright Data

  • Replace <YOUR_BRIGHT_DATA_API_KEY> with your real key
  • Set your dataset ID in all HTTP Request nodes

Step 4: Configure OpenAI (LangChain)

  • Connect OpenAI API key to the LangChain node
  • Customize prompt to match tone and outreach style

Step 5: Testing & Scheduling

  • Test via manual form trigger
  • Schedule runs or leave form enabled for on-demand use

🧠 Tips & Best Practices

  • Use specific keywords and locations for better results
  • Adjust polling intervals based on dataset size
  • Refine AI prompts regularly to improve pitch quality
  • Clean unused columns from your sheet to boost performance

💬 Support & Feedback

For help or customization:

📧 Email: [email protected]
📺 YouTube: @YaronBeen
🔗 LinkedIn: linkedin.com/in/yaronbeen

📚 Bright Data Docs: docs.brightdata.com/introduction