POST /v1/chat/completionsDeepSeek V4 Pro
deepseek-v4-proDeepSeek V4 Pro is described as a large-scale Mixture-of-Experts model with 1.6T total parameters and 49B activated parameters, while keeping a 1M-token context window for very large inputs. Its model cards emphasize advanced reasoning, coding, and long-horizon agent workflows rather than simple chat. The Pro variant is the capability-oriented member of the V4 family, making it better suited to full-codebase analysis, large research synthesis, and multi-step automation where depth matters more than the lowest possible latency.
Total Context
1Mtokens
Max Output
384Ktokens
Released
Apr 24, 2026
Modalities
DeepSeek V4 Pro Price
| Input Price | Output Price | Cache Read |
|---|---|---|
| $1.8/M | $3.5/M | $0.015/M |
DeepSeek V4 Pro API
DeepSeek V4 Pro Benchmark
40.8
/100
Artificial Analysis Intelligence Index
Artificial Analysis broad capability aggregate
Index score
43.2
/100
Artificial Analysis Coding Index
Artificial Analysis software task aggregate
Index score
Knowledge & Reasoning
GPQA
Advanced science problem solving
90.5%
HLE
Broad expert-level exam set
33.5%
Coding & Engineering
SciCode
Scientific coding challenges
46.4%
Terminal-Bench Hard
Hard terminal task execution
41.7%
Instruction Following & Agent Tasks
IFBench
Prompt constraint adherence
71.3%
AA-LCR
Long-context reasoning
65%
τ²-Bench
Agent workflow tasks
94.2%
Metrics sourced from Artificial Analysis
Model Comparison
DeepSeek V4 Pro FAQ
DeepSeek V4 Pro: capabilities, use cases, limits, and TokenHub guidance.
What is DeepSeek V4 Pro?+
DeepSeek V4 Pro is a DeepSeek model for flagship reasoning, coding, and agent work.
Which workloads suit DeepSeek V4 Pro?+
Best for complex coding, agent workflows and long-context analysis, especially when maximum answer quality is the priority.
Which feature stands out?+
Key strength: top-tier reasoning and agentic coding within DeepSeek V4 and switchable thinking and non-thinking modes.
When should teams avoid DeepSeek V4 Pro?+
It uses more compute, so latency and cost can be higher. For speed and cost efficiency, consider DeepSeek V4 Flash.
What should I verify in TokenHub?+
Use the exact ID shown by TokenHub; follow your account docs and verify current features.
Media and Discussions
Selected public videos and posts related to this model.
X (Twitter)
Reddit
YouTube