DeepSeek V4 Pro vs DeepSeek V4 Flash

Name: DeepSeek V4 Pro vs DeepSeek V4 Flash benchmark dataset
Creator: TokenHub

DeepSeek V4 Pro from DeepSeek and DeepSeek V4 Flash from DeepSeek are shown side by side so you can compare pricing, model IDs, context and output limits, modalities, tool use, release details, and benchmark results where data is available.

DeepSeek V4 ProDeepSeek

DeepSeek V4 FlashDeepSeek

Basic Information

Name

DeepSeek V4 Pro

Model id

deepseek-v4-pro

Intro

DeepSeek V4 Pro is described as a large-scale Mixture-of-Experts model with 1.6T total parameters and 49B activated parameters, while keeping a 1M-token context window for very large inputs. Its model cards emphasize advanced reasoning, coding, and long-horizon agent workflows rather than simple chat. The Pro variant is the capability-oriented member of the V4 family, making it better suited to full-codebase analysis, large research synthesis, and multi-step automation where depth matters more than the lowest possible latency.

Author

DeepSeek

Released date

2026-04-24

Name

DeepSeek V4 Flash

Model id

deepseek-v4-flash

Intro

DeepSeek V4 Flash keeps the V4 family’s 1M-token context window but uses a lighter MoE configuration, commonly described as 284B total parameters with 13B activated parameters. The emphasis is throughput: fast inference, lower cost per call, and production workloads that still need long-context handling. It is the better fit when the task volume is high and the workload benefits from V4-style long-context architecture without always requiring the deepest reasoning tier.

Author

DeepSeek

Released date

2026-04-24

Pricing

Input

$1.8/M

Output

$3.5/M

Cached input

$0.015/M

Input

$0.15/M

Output

$0.3/M

Cached input

$0.003/M

Context & Output

Context length

Max output tokens

384K

Context length

Max output tokens

384K

Capabilities

Reasoning

Knowledge

2025-05-01

Attachment

Input modalities

Text

Output modalities

Text

Temperature

Tool use

Reasoning

Knowledge

2025-05-01

Attachment

Input modalities

Text

Output modalities

Text

Temperature

Tool use

Benchmark

Intelligence

wins

40.8

DeepSeek V4 Pro

40.3

DeepSeek V4 Flash

Coding

wins

43.2

DeepSeek V4 Pro

38.7

DeepSeek V4 Flash

DeepSeek V4 ProDeepSeek

DeepSeek V4 FlashDeepSeek

Knowledge & Reasoning

GPQA

wins90.5%

HLE

wins33.5%

GPQA

89.4%

HLE

32.1%

Coding

SciCode

wins46.4%

Terminal-Bench Hard

wins41.7%

SciCode

44.9%

Terminal-Bench Hard

35.6%

Instruction Following & Agent Tasks

IFBench

71.3%

AA-LCR

wins65%

Tau2

94.2%

IFBench

wins79.2%

AA-LCR

63%

Tau2

wins95.0%

FAQ

Which model is better, DeepSeek V4 Pro or DeepSeek V4 Flash?+

DeepSeek V4 Pro and DeepSeek V4 Flash should be compared by workload. This page places price, context, output limits, capabilities, and benchmark data side by side.

Is DeepSeek V4 Flash cheaper than DeepSeek V4 Pro?+

Use the pricing rows above to compare input, output, and cached input prices for DeepSeek V4 Pro and DeepSeek V4 Flash.

Which model supports a longer context length?+

DeepSeek V4 Pro lists 1M context length, while DeepSeek V4 Flash lists 1M.

Can I access both models through TokenHub?+

If both models are available in the TokenHub catalog, you can route requests to DeepSeek V4 Pro and DeepSeek V4 Flash through the TokenHub API.