Transcribe long audio files beyond 25MB limit with FileFlows and OpenAI Whisper

Created by

Julien DEL RIO

Last update

Last update 3 days ago

Who's it for

This template is designed for content creators, podcasters, businesses, and researchers who need to transcribe long audio recordings that exceed OpenAI Whisper's 25 MB file size limit (~20 minutes of audio).

How it works

This workflow combines n8n, FileFlows, and OpenAI Whisper API to transcribe audio files of any length:

User uploads an MP3 file through a web form and provides an email address
n8n splits the file into 4 MiB chunks and uploads them to FileFlows
FileFlows uses FFmpeg to segment the audio into 15-minute chunks (safely under the 25 MB API limit)
Each segment is transcribed using OpenAI's Whisper API (configured for French by default)
All transcriptions are merged into a single text file
The complete transcription is automatically emailed to the user

Processing time: Typically 10-15 minutes for a 1-hour audio file.

Requirements

n8n instance (self-hosted or cloud)
FileFlows with Docker and FFmpeg installed
OpenAI API key (Whisper API access)
Gmail account for email delivery
Network access between n8n and FileFlows

Setup

Complete setup instructions, including FileFlows workflow import, credentials configuration, and storage setup, are provided in the workflow's sticky notes.

Cost

OpenAI Whisper API: $0.006 per minute. A 1-hour recording costs approximately $0.36.