DeepSeek V4 Pro vs DeepSeek V4 Flash

DeepSeek V4 Pro from DeepSeek and DeepSeek V4 Flash from DeepSeek are shown side by side so you can compare pricing, model IDs, context and output limits, modalities, tool use, release details, and benchmark results where data is available.

DeepSeek V4 ProDeepSeek
DeepSeek V4 FlashDeepSeek

Basic Information

Name

DeepSeek V4 ProDeepSeek V4 Pro

Model id

DeepSeek V4 Prodeepseek-v4-pro

Intro

DeepSeek V4 ProDeepSeek V4 Pro is described as a large-scale Mixture-of-Experts model with 1.6T total parameters and 49B activated parameters, while keeping a 1M-token context window for very large inputs. Its model cards emphasize advanced reasoning, coding, and long-horizon agent workflows rather than simple chat. The Pro variant is the capability-oriented member of the V4 family, making it better suited to full-codebase analysis, large research synthesis, and multi-step automation where depth matters more than the lowest possible latency.

Author

DeepSeek V4 ProDeepSeek

Released date

DeepSeek V4 Pro2026-04-24

Name

DeepSeek V4 FlashDeepSeek V4 Flash

Model id

DeepSeek V4 Flashdeepseek-v4-flash

Intro

DeepSeek V4 FlashDeepSeek V4 Flash keeps the V4 family’s 1M-token context window but uses a lighter MoE configuration, commonly described as 284B total parameters with 13B activated parameters. The emphasis is throughput: fast inference, lower cost per call, and production workloads that still need long-context handling. It is the better fit when the task volume is high and the workload benefits from V4-style long-context architecture without always requiring the deepest reasoning tier.

Author

DeepSeek V4 FlashDeepSeek

Released date

DeepSeek V4 Flash2026-04-24

Pricing

Input

DeepSeek V4 Pro$1.8/M

Output

DeepSeek V4 Pro$3.5/M

Cached input

DeepSeek V4 Pro$0.015/M

Input

DeepSeek V4 Flash$0.15/M

Output

DeepSeek V4 Flash$0.3/M

Cached input

DeepSeek V4 Flash$0.003/M

Context & Output

Context length

DeepSeek V4 Pro1M

Max output tokens

DeepSeek V4 Pro384K

Context length

DeepSeek V4 Flash1M

Max output tokens

DeepSeek V4 Flash384K

Capabilities

Reasoning

DeepSeek V4 Pro

Knowledge

DeepSeek V4 Pro2025-05-01

Attachment

DeepSeek V4 Pro

Input modalities

DeepSeek V4 ProText

Output modalities

DeepSeek V4 ProText

Temperature

DeepSeek V4 Pro

Tool use

DeepSeek V4 Pro

Reasoning

DeepSeek V4 Flash

Knowledge

DeepSeek V4 Flash2025-05-01

Attachment

DeepSeek V4 Flash

Input modalities

DeepSeek V4 FlashText

Output modalities

DeepSeek V4 FlashText

Temperature

DeepSeek V4 Flash

Tool use

DeepSeek V4 Flash

Benchmark

Intelligence

wins

40.8

DeepSeek V4 Pro

40.3

DeepSeek V4 Flash

Coding

wins

43.2

DeepSeek V4 Pro

38.7

DeepSeek V4 Flash

DeepSeek V4 ProDeepSeek
DeepSeek V4 FlashDeepSeek

Knowledge & Reasoning

GPQA

wins90.5%

HLE

wins33.5%

GPQA

89.4%

HLE

32.1%

Coding

SciCode

wins46.4%

Terminal-Bench Hard

wins41.7%

SciCode

44.9%

Terminal-Bench Hard

35.6%

Instruction Following & Agent Tasks

IFBench

71.3%

AA-LCR

wins65%

Tau2

94.2%

IFBench

wins79.2%

AA-LCR

63%

Tau2

wins95.0%

FAQ

Which model is better, DeepSeek V4 Pro or DeepSeek V4 Flash?+

DeepSeek V4 Pro and DeepSeek V4 Flash should be compared by workload. This page places price, context, output limits, capabilities, and benchmark data side by side.

Is DeepSeek V4 Flash cheaper than DeepSeek V4 Pro?+

Use the pricing rows above to compare input, output, and cached input prices for DeepSeek V4 Pro and DeepSeek V4 Flash.

Which model supports a longer context length?+

DeepSeek V4 Pro lists 1M context length, while DeepSeek V4 Flash lists 1M.

Can I access both models through TokenHub?+

If both models are available in the TokenHub catalog, you can route requests to DeepSeek V4 Pro and DeepSeek V4 Flash through the TokenHub API.