DeepSeek
DeepSeek V3.2

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism that reduces training and inference cost while preserving quality in long-context scenarios. A scalable reinforcement learning post-training framework further improves reasoning, with reported performance in the GPT-5 class, and the model has demonstrated gold-medal results on the 2025 IMO and IOI. V3.2 also uses a large-scale agentic task synthesis pipeline to better integrate reasoning into tool-use settings, boosting compliance and generalization in interactive environments.

Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

Model details

Context window163,840 tokens

Max completion size51 tokens

Prompt cost / 1K tokens$0.00000028

Completion cost / 1K tokens$0.0000004

Usage pricing
Prompt	$0.00000028
Completion	$0.0000004
Request	FREE
Image	FREE
Web Search	FREE
Internal Reasoning	FREE
Input Cache Read	FREE
Input Cache Write	FREE

placement

Browse all LLMs

Model details

Benchmark performanceAll scores have maximum of 100 points.

Overall

Cost

Logic

Speed

Scoring

Tool Use

Hallucination

Classification

Structured Output

Pricing

Grok 4 Fast

Qwen3 VL 235B A22B Instruct

Grok 4.1 Fast

GPT-5.1 Chat

GPT-5.1-Codex

Claude Haiku 4.5

Benchmark performance