GPT-4o

gpt-4o

GPT-4o is OpenAI’s multimodal flagship from the GPT-4o generation, built for text and image input with strong general intelligence. Official docs describe it as a versatile high-intelligence model suitable for a broad range of language and vision tasks. It remains useful where multimodal understanding and natural interaction matter more than the newest reasoning stack.

Total Context

128Ktokens

Max Output

16.4Ktokens

Released

May 13, 2024

Modalities

GPT-4o Price

Input PriceOutput PriceCache Read
$2.5/M$10/M$1.25/M

GPT-4o API

POST /v1/messages

GPT-4o Benchmark

9.6

/100

Artificial Analysis Intelligence Index

Artificial Analysis broad capability aggregate

Index score

16.6

/100

Artificial Analysis Coding Index

Artificial Analysis software task aggregate

Index score

Knowledge & Reasoning

GPQA

Advanced science problem solving

52.1%

HLE

Broad expert-level exam set

2.9%

Coding & Engineering

LiveCodeBench

Live coding problems

31.7%

SciCode

Scientific coding challenges

33.1%

Terminal-Bench Hard

Hard terminal task execution

8.3%

Math

MATH-500

Advanced math problem solving

79.5%

AIME

Competition math problems

11.7%

Instruction Following & Agent Tasks

IFBench

Prompt constraint adherence

36.0%

AA-LCR

Long-context reasoning

35%

τ²-Bench

Agent workflow tasks

28.9%

Metrics sourced from Artificial Analysis

Media and Discussions

Selected public videos and posts related to this model.

X (Twitter)

View post on X
View post on X
View post on X

Reddit

YouTube

Watch on YouTube
Watch on YouTube
Watch on YouTube

Frequently asked questions about GPT-4o

Understand what GPT-4o is, its best uses, distinguishing strengths, practical tradeoffs, and safe TokenHub integration guidance.

What kind of model is GPT-4o?+

GPT-4o is OpenAI’s earlier omni model for general text and visual understanding, now positioned as an older API option. It has been retired from ChatGPT, while API availability may remain; check TokenHub’s current listing.

What should teams use GPT-4o for?+

Best-fit scenarios include analysis of text and visual inputs, responsive interactive assistants, and general-purpose content generation. Test representative inputs and define measurable acceptance criteria before production.

Where does GPT-4o have a clear technical advantage?+

Key strengths include combined text and image understanding, broad general-purpose capability, and responsive conversational behavior. This combination is especially useful for responsive interactive assistants.

When should a team choose another model instead of GPT-4o?+

Consider another model when a new build should use the provider’s current recommended generation, the task needs a dedicated reasoning model, or the workflow cannot include human review for important decisions. Verify important factual, legal, financial, medical, or operational outputs with qualified human review.

What should be checked before integrating GPT-4o with TokenHub?+

In TokenHub, select the exact model identifier displayed for GPT-4o, use the endpoint documented for your account, and authenticate with your TokenHub credentials. Check the current TokenHub documentation for supported text and image inputs, because platform exposure can differ from the provider’s full model capabilities.