Sora 2

sora-2

OpenAI’s new flagship model for video generation with synced audio.

Intelligence

Higher

Speed

Slow

Price

$0.1000

per second

Input

Text, Image

Output

Audio, Video

Start using Sora 2 now

Support Subscription & Pay-as-you-go matching your needs in Studio & API

View Pricing

Specifications

Context Window

66K

Max Output Tokens

32K

Announced Date

2024-09-01

Modalities

Text

Input only

Image

Input only

Audio

Output only

Video

Output only

Endpoints

OpenAI Compatible

Chat Completions

/v1/chat/completions

Responses

/v1/responses

Embeddings

/v1/embeddings

Image Generation

/v1/images/generations

Image Edit

/v1/images/edits

Videos

/v1/videos

Claude Compatible

Messages

/v1/messages

Gemini Compatible

Generate Content

/v1beta/models/{model}:{operator}

Performance Rankings

View Full Leaderboard

Text to Video

Grok Imagine Video

724

Veo 3.1

618

WAN Video 2.6

577

Sora 2 Pro

571

Sora 2

551

🥉

WAN Video 2.6

577

Sora 2 Pro

571

Sora 2This Model

551

Veo 3.0

495

Kling v2.5 Turbo Pro

487

Image to Video

Grok Imagine Video

2,634

Kling v2.5 Turbo Pro

2,591

Veo 3.1

2,521

Kling v2.6 Pro

2,490

Veo 3.0 Fast

2,486

Hailuo 2.3

2,280

SeeDance 1 Lite

2,240

Sora 2This Model

2,223

SeeDance 1 Pro Fast

2,166

Sora 2 Pro

1,988

Video to Video

Metric Definitions

LLM

Code Arena: Average score across coding arenas based on human votes.
Chat Arena: Human preference score from blind comparisons.
GPQA: Graduate-level science questions requiring expert knowledge.
AIME 2025: Recent math competition problems.
SWE-Bench: Real GitHub issues requiring code changes.
ARC-AGI v2: Abstract reasoning problems.

Image

IMAGE GEN: Human preference score for text-to-image generation.
IMAGE EDIT: Human preference score for image editing and transformation.

Video

Text to Video: Human preference score for text-to-video generation.
Image to Video: Human preference score for image-to-video generation.
Video to Video: Human preference score for video editing capabilities.

TTS

TTS: Human preference score for text-to-speech quality.

STT

STT: Human preference score for transcription accuracy.

Ready to use Sora 2?

Subscription & Pay-as-you-go

Powerful API Access

High Cost-Effectiveness

Interactive Studio UI

View Pricing