Block prompt injection before AI agents with GPT-5.5, Slack, and Google Sheets

Created by

Last update

Last update a month ago

How it works

The workflow receives text through a webhook and runs a zero-trust screening process before the input reaches a business agent. A Code node checks for deterministic attack patterns such as instruction overrides, role hijacking, prompt leakage attempts, hidden Unicode, SQL-style payloads, and XSS-style content. A second layer extracts URLs and applies local suspicious-domain heuristics. An isolated GPT-5.5 evaluator then classifies semantic risk, including prompt injection, data exfiltration intent, and social engineering. A scoring node combines the signals and routes low-risk input to a safe response while suspicious input is blocked, logged, and sent to Slack.

How to set up

Connect OpenAI, Slack, and Google Sheets credentials. Send untrusted input to the /firewall-check webhook before your main AI agent workflow.

Requirements

OpenAI access to GPT-5.5, Slack credentials, Google Sheets, and a webhook caller.

How to customize the workflow

Add your own attack patterns, tune risk thresholds, connect URLScan, VirusTotal, or Safe Browsing, and forward safe traffic to your production agent.