DeepSeek v3.2

deepseek-v3.2

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance.

Intelligence

Higher

Speed

Medium

Price

$0.28 • $0.40

Input • Output

Input

Text

Output

Text

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism that reduces training and inference cost while preserving quality in long-context scenarios. A scalable reinforcement learning post-training framework further improves reasoning, with reported performance in the GPT-5 class, and the model has demonstrated gold-medal results on the 2025 IMO and IOI. V3.2 also uses a large-scale agentic task synthesis pipeline to better integrate reasoning into tool-use settings, boosting compliance and generalization in interactive environments.

163800 context window

65500 max output tokens

knowledge cutoff

Modalities

Text

Input and output

Image

Not supported

Audio

Not supported

Video

Not supported

Endpoints

Chat Completions

v1/chat/completions

Responses

v1/responses

Realtime

v1/realtime

Assistants

v1/assistants

Batch

v1/batch

Fine-tuning

v1/fine-tuning

Embeddings

v1/embeddings

Image generation

v1/images/generations

Videos

v1/videos

Image edit

v1/images/edits

Speech generation

v1/audio/speech

Transcription

v1/audio/transcriptions

Translation

v1/audio/translations

Moderation

v1/moderations

Completions (legacy)

v1/completions

Features

Streaming

Supported

Function calling

Not supported

Structured outputs

Not supported

Fine-tuning

Not supported

Distillation

Not supported

DeepSeek v3.2

Modalities

Text

Image

Audio

Video

Endpoints

Chat Completions

Responses

Realtime

Assistants

Batch

Fine-tuning

Embeddings

Image generation

Videos

Image edit

Speech generation

Transcription

Translation

Moderation

Completions (legacy)

Features

Streaming

Function calling

Structured outputs

Fine-tuning

Distillation

Ready to get started?