Community nodes can only be installed on self-hosted instances of n8n.
This workflow automates the real-time extraction of Job Descriptions and Salary Information from job listing pages using Bright Data MCP and analyzes content using OpenAI GPT-4o mini.
This workflow is ideal for:
Recruiters & HR Tech Startups: Automate job data collection from public listings
Market Intelligence Teams: Analyze compensation trends across companies or geographies
Job Boards & Aggregators: Power search results with structured, enriched listings
AI Workflow Builders: Extend to other career platforms or automate resume-job match analysis
Analysts & Researchers: Track hiring signals and salary benchmarks in real time
Traditional scraping of job portals can be challenging due to cluttered content, anti-scraping measures, and inconsistent formatting. Manually analyzing salary ranges and job descriptions is tedious and error-prone.
This workflow solves the problem by:
Simulating user behavior using Bright Data MCP Client to bypass anti-scraping systems
Extracting structured, clean job data in Markdown format
Using OpenAI GPT-4o mini to analyze and extract precise salary details and refined job descriptions
Merging and formatting the result for easy consumption
Delivering final output via webhook, Google Sheets, or file system
Components & Flow
Input Nodes
job_search_url: The job listing or search result URL
job_role: The title or role being searched for (used in logging/formatting)
MCP Client Operations
MCP Salary Data Extractor
Simulates browser behavior and scrapes salary-related content (if available)
MCP Job Description Extractor
Extracts full job description as structured Markdown content
OpenAI GPT-4o mini Nodes
Salary Information Extractor
Job Description Refiner
Company Information Extractor
Merge Node
Aggregate node
Final Output Handling
The output is handled in three different formats depending on your downstream needs:
Save to Disk
Google Sheet Update
Webhook Notification
Make sure to copy the Bright Data API_TOKEN within the Environments textbox above as API_TOKEN=<your-token>
Modify Input Source
Change the job_search_url to point to any job board or aggregator
Customize job_role to reflect the type of jobs being analyzed
Tweak LLM Prompts (Optional)
Change Output Format
Customize the merged object to output JSON, CSV, or Markdown based on downstream needs
Add additional destinations (e.g., Slack, Airtable, Notion) via n8n nodes