Back to Templates

Convert HTML to PDF & Extract Text from PDFs with CustomJS API

Created by

Created by: CustomJS || customjs

CustomJS

Last update

Last update 2 months ago

Categories

Share


workflow.png
This n8n workflow illustrates how to convert PDF files into text with the PDF Toolkit from www.customjs.space.

@custom-js/n8n-nodes-pdf-toolkit

Notice

Community nodes can only be installed on self-hosted instances of n8n.

What this workflow does

  • Change the requested HTML to PDF..
  • Extract text from the PDF.
  • Use a Code node to handle URLs that point to PDF files.
  • Convert the PDF to text.

Requirements

  • Self-hosted n8n instance.
  • CustomJS API key for converting PDF to text.
  • HTML Data to convert PDF files.
  • Code node for handling URL that indicates PDF file.

Workflow Steps:

  1. Manual Trigger:

    • Runs with user interaction.
  2. HTML to PDF:

    • Request HTML Data
    • Convert HTML to PDF
  3. Convert PDF to Text:

    • Convert the generated Text from PDF

Usage

Get API key from customJS

  • Sign up to customJS platform.
  • Navigate to your profile page
  • Press "Show" button to get API key
    1.png

Set Credentials for CustomJS API on n8n

Copy and paste your API key generated from CustomJS here.
2.png

Design workflow

  • A Manual Trigger for starting workflow.
  • HTTP Request Nodes for downloading PDF files.
  • Code node for handling URL that indicates PDF file.
  • Convert PDF to Text.

You can replace logic for triggering and returning results.
For example, you can trigger this workflow by calling a webhook and get a result as a response from webhook. Simply replace Manual Trigger and Write to Disk nodes.