Split Test Different Agent Prompts with Supabase and OpenAI

Created by

Chris Carr

Last update

Last update 5 months ago

Use Case

Oftentimes, it's useful to test different settings for a large language model in production against various metrics. Split testing is a good method for doing this.

What it Does

This workflow randomly assigns chat sessions to one of two prompts, the baseline and the alternative. The agent will use the same prompt for all interactions in that chat session.

How it Works

When messages arrive, a table containing information regarding session ID and which prompt to use is checked to see if the chat already exists
If it does not, the session ID is added to the table and a prompt is randomly assigned
These values are then used to generate a response

Setup

Create a table in Supabase called split_test_sessions. It needs to have the following columns: session_id (text) and show_alternative (bool)
Add your Supabase, OpenAI, and PostgreSQL credentials
Modify the Define Path Values node to set the baseline and alternative prompt values.
Activate the workflow and test by sending messages through n8n's inbuilt chat
Experiment with different chat sessions to test see both prompts in action

Next Steps

Modify the workflow to test different LLM settings such as temperature
Add a method to measure the efficacy of the two alternative prompts