Qwen 3 Max

qwen3-max

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following and long-tail knowledge coverage.

Intelligence

Higher

Speed

Slow

Price

$2.50 • $10.00

Input • Output

Cache Price

$2.50 • $2.50

Read • Write

Input

Text

Output

Text

Start using Qwen 3 Max now

Support Subscription & Pay-as-you-go matching your needs in Studio & API

View Pricing

Specifications

Context Window

256K

Max Output Tokens

33K

Released Date

2025-12-15

Modalities

Text

Input and output

Image

Not supported

Audio

Not supported

Video

Not supported

Endpoints

OpenAI Compatible

Chat Completions

/v1/chat/completions

Responses

/v1/responses

Embeddings

/v1/embeddings

Image Generation

/v1/images/generations

Image Edit

/v1/images/edits

Videos

/v1/videos

Claude Compatible

Messages

/v1/messages

Gemini Compatible

Generate Content

/v1beta/models/{model}:{operator}

Performance Rankings

View Full Leaderboard

Code Arena

Gemini 3 Pro Image

820

Gemini 3.1 Flash Image

533

Qwen3 Max

🥇

Gemini 3 Pro Image

820

🥈

Gemini 3.1 Flash Image

533

🥉

Qwen 3 MaxThis Model

Chat Arena

Gemini 3.1 Flash Image

893

Gemini 3 Pro Image

889

GPQA

Claude Mythos Preview

94.6%

Gemini 3.1 Pro

94.3%

Claude Opus 4.7

94.2%

GPT-5.5

93.6%

GPT-5.2 Pro

93.2%

135

Gemini 2.0 Flash

62.1%

136

DeepSeek R1 Distill Qwen 32B

62.1%

137

Qwen3 MaxThis Model

62.0%

138

o1-mini

60.0%

139

Claude 3.5 Sonnet

59.4%

AIME 2025

Grok-4 Heavy

100.0%

GPT-5.2

100.0%

Gemini 3 Pro

100.0%

Kimi K2-Thinking-0905

100.0%

GPT-5.2 Pro

100.0%

Qwen3 VL 30B A3B Thinking

83.1%

Gemini 2.5 Pro

83.0%

Qwen3 MaxThis Model

81.6%

Qwen3 235B A22B

81.5%

GPT-5.5 Instant

81.2%

SWE-Bench

Claude Mythos Preview

93.9%

Claude Opus 4.7

87.6%

Claude Opus 4.5

80.9%

Claude Opus 4.6

80.8%

DeepSeek-V4-Pro-Max

80.6%

Claude 3.7 Sonnet

70.3%

LongCat-Flash-Thinking-2601

70.0%

Qwen3 MaxThis Model

69.6%

Qwen3-Coder 480B A35B Instruct

69.6%

MiniMax M2

69.4%

ARC-AGI v2

GPT-5.5

85.0%

Gemini 3.1 Pro

77.1%

GPT-5.4

73.3%

Claude Opus 4.6

68.8%

Claude Sonnet 4.6

58.3%

Metric Definitions

LLM

Code Arena: Average score across coding arenas based on human votes.
Chat Arena: Human preference score from blind comparisons.
GPQA: Graduate-level science questions requiring expert knowledge.
AIME 2025: Recent math competition problems.
SWE-Bench: Real GitHub issues requiring code changes.
ARC-AGI v2: Abstract reasoning problems.

Image

IMAGE GEN: Human preference score for text-to-image generation.
IMAGE EDIT: Human preference score for image editing and transformation.

Video

Text to Video: Human preference score for text-to-video generation.
Image to Video: Human preference score for image-to-video generation.
Video to Video: Human preference score for video editing capabilities.

TTS

TTS: Human preference score for text-to-speech quality.

STT

STT: Human preference score for transcription accuracy.

Ready to use Qwen 3 Max?

Subscription & Pay-as-you-go

Powerful API Access

High Cost-Effectiveness

Interactive Studio UI

View Pricing