GPT-5.1 Chat

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on harder queries, improving accuracy on math, coding, and multi-step tasks without slowing down typical conversations. The model is warmer and more conversational by default, with better instruction following and more stable short-form reasoning. GPT-5.1 Chat is designed for high-throughput, interactive workloads where responsiveness and consistency matter more than deep deliberation.

Share

Model details

Context window128,000 tokens
Max completion size28 tokens
Prompt cost / 1K tokens$0.00000125
Completion cost / 1K tokens$0.00001
Accepts
Produces

Benchmark performance

Overall

82
score
4th
placement

Cost

91
score
9th
placement

Logic

84
score
5th
placement

Speed

97
score
3rd
placement

Scoring

26
score
18th
placement

Tool Use

51
score
4th
placement

Hallucination

83
score
7th
placement

Classification

50
score
1st
placement

Structured Output

92
score
2nd
placement

Pricing

Usage pricing
Prompt
$0.00000125
Completion
$0.00001
Request
FREE
Image
FREE
Web Search
$0.010
Internal Reasoning
FREE
Input Cache Read
FREE
Input Cache Write
FREE

Best Overall scoring LLMs

xAI

Grok 4 Fast

88
score
1st
placement
Qwen

Qwen3 VL 235B A22B Instruct

86
score
2nd
placement
xAI

Grok 4.1 Fast

84
score
3rd
placement
OpenAI

GPT-5.1 Chat

82
score
4th
placement
OpenAI

GPT-5.1-Codex

82
score
4th
placement
Anthropic

Claude Haiku 4.5

80
score
5th
placement