This template is designed for podcasters, researchers, educators, product teams, and support teams who work with audio content and want to turn it into searchable knowledge. It is especially useful for users who need automated transcription, structured summaries, and conversational access to audio data.
This workflow starts with a public form where users upload an audio file.
The audio is sent to AssemblyAI for speech-to-text processing, including speaker labels and bullet-point summarization.
Once transcription is complete, the full text is converted into a document, split into chunks, and embedded using Google Gemini.
The embeddings are stored in a Pinecone vector database along with metadata, making the content retrievable for future use.
In parallel, the workflow logs uploaded file information into Google Sheets for tracking.
A separate chat trigger allows users to ask questions about the uploaded audio files.
An AI agent retrieves relevant context from Pinecone and responds using Gemini, enabling conversational search over audio transcripts.