Transcribe and Analyze Meetings or Podcasts with ElevenLabs and Google Gemini AI

Created by

Davide

Last update

Last update 3 months ago

Key Advantages

✅ Automation of Repetitive Tasks: Eliminates manual transcription, note-taking, and reporting.
✅ High-Accuracy Transcription: Powered by ElevenLabs, supporting multiple languages and long files.
✅ Structured Insights: AI-powered analysis (Gemini) transforms raw transcripts into professional summaries with action items and key decisions.
✅ Seamless Integration: Works with Google Sheets, Google Docs, and Gmail for easy workflow adoption.
✅ Scalable & Reusable: Handles multiple recordings in batch and integrates smoothly into existing processes.
✅ Improved Productivity: Teams save time by receiving clear, well-structured meeting recaps without effort.

The workflow operates in a linear sequence, processing one audio URL at a time from a Google Sheet.

Trigger & Data Fetch: The process starts manually. It first reads from a specified Google Sheet to get a list of audio file URLs that haven't been processed yet (marked with "DONE").
Transcription: For each audio URL, it sends a request to the ElevenLabs Speech-to-Text API, passing the cloud storage URL of the audio file.
Document Creation & Storage: Upon receiving the transcription text from ElevenLabs, the workflow:
- Creates a new Google Doc named after the transcription ID.
- Writes the full raw transcript text into this newly created document.
Analysis & Summary: The raw transcript text is passed to Google Gemini AI. The AI analyzes the text based on a detailed, pre-defined system prompt that instructs it to extract key points, decisions, action items, and insights, formatting the output as a structured summary.
Notification: The AI-generated analysis (converted from Markdown to HTML) is sent via Gmail as an email to a predefined address.
Status Update: Finally, the workflow updates the original row in the Google Sheet, marking it as "DONE" (with an 'x') and recording the ID of the newly created Google Doc for future reference. This ensures the same audio file is not processed again on the next run.

To configure this workflow, you need to complete the following steps:

Credentials Setup:
- ElevenLabs API: In the "Speech-to-Text" HTTP Request node, configure the HTTP Header Auth credentials. The header name should be xi-api-key and the value should be your valid ElevenLabs API key.
- Google Services: Ensure your Google Sheets, Google Docs, and Gmail nodes are connected to valid Google OAuth2 credentials with the necessary permissions to read, write, and send emails.
- Google Gemini AI: The "Transcript Analysis" node must be connected to valid Google Gemini (PaLM) API credentials.
Google Sheet Configuration:
- Make a copy of the provided Google Sheet template.
- In the "Get audio urls" node, update the documentId parameter to match the ID of your new Google Sheet.
- Similarly, update the documentId in the "Update row" node.
- Populate your sheet with the URLs of the audio/video files you want to process in the 'AUDIO URL' column.
Output Configuration:
- In the "Send email" node, change the sendTo parameter to the email address where you want to receive the analysis summaries.
- The "Create Doc" node specifies a folderId; change this to the ID of the specific Google Drive folder where you want the transcripts to be saved.

Contact me for consulting and support or add me on Linkedin.