POST /v1/chat/completionsGPT-4.1 Mini
gpt-4.1-miniGPT-4.1 Mini brings the GPT-4.1 family’s coding and instruction-following improvements into a faster, lower-cost form. It is suitable for high-volume developer tools, structured generation, extraction, and product features that do not require the full model. The main distinction is production efficiency while retaining the 4.1 generation’s task discipline.
Total Context
1Mtokens
Max Output
32.8Ktokens
Released
Apr 14, 2025
Modalities
GPT-4.1 Mini Price
| Input Price | Output Price | Cache Read |
|---|---|---|
| $0.4/M | $1.6/M | $0.1/M |
GPT-4.1 Mini API
GPT-4.1 Mini Benchmark
GPT-4.1 mini
16.3
/100
Artificial Analysis Intelligence Index
Artificial Analysis broad capability aggregate
Index score
18.5
/100
Artificial Analysis Coding Index
Artificial Analysis software task aggregate
Index score
46.3
/100
Artificial Analysis Math Index
Artificial Analysis math reasoning aggregate
Index score
Knowledge & Reasoning
MMLU-Pro
Advanced multi-task knowledge
78.1%
GPQA
Advanced science problem solving
66.4%
HLE
Broad expert-level exam set
4.6%
Coding & Engineering
LiveCodeBench
Live coding problems
48.3%
SciCode
Scientific coding challenges
40.4%
Terminal-Bench Hard
Hard terminal task execution
7.6%
Math
MATH-500
Advanced math problem solving
92.5%
AIME
Competition math problems
43%
AIME 2025
Competition math problems
46.3%
Instruction Following & Agent Tasks
IFBench
Prompt constraint adherence
38.3%
AA-LCR
Long-context reasoning
42.3%
τ²-Bench
Agent workflow tasks
52.9%
Metrics sourced from Artificial Analysis
Frequently asked questions about GPT-4.1 Mini
Understand what GPT-4.1 Mini is, its best uses, distinguishing strengths, practical tradeoffs, and safe TokenHub integration guidance.
How should developers understand the role of GPT-4.1 Mini?+
GPT-4.1 Mini is a smaller, faster GPT-4.1-family model for efficient instruction following and tool-enabled applications. It has been retired from ChatGPT, while API availability may remain; check TokenHub’s current listing.
When does GPT-4.1 Mini deliver the most practical value?+
Best-fit scenarios include high-volume application requests, strict instruction following, and tool-enabled application workflows. Test representative inputs and define measurable acceptance criteria before production.
What are the most useful characteristics of GPT-4.1 Mini?+
Key strengths include fast response times, cost-efficient scaling, and strong handling of long context. This combination is especially useful for strict instruction following.
What are the practical limits of GPT-4.1 Mini?+
Consider another model when the task requires the provider’s strongest reasoning capability, quality matters more than speed or cost, or the workflow cannot include human review for important decisions. Verify important factual, legal, financial, medical, or operational outputs with qualified human review.
How should developers call GPT-4.1 Mini through TokenHub?+
In TokenHub, select the exact model identifier displayed for GPT-4.1 Mini, use the endpoint documented for your account, and authenticate with your TokenHub credentials. Confirm whether the TokenHub entry exposes the input types, tool behavior, and output controls your application needs.
Media and Discussions
Selected public videos and posts related to this model.
X (Twitter)
Reddit
YouTube