GLM-4.5

glm-4.5

GLM-4.5 is an agent-oriented MoE model from Z.ai, described with 355B total parameters and 32B activated parameters. Official docs highlight reasoning, coding, tool use, and browser-style agent abilities, with both thinking and non-thinking modes. It works well as the GLM line’s earlier agent foundation model before GLM-5.

Total Context

131.1Ktokens

Max Output

98.3Ktokens

Released

Jul 28, 2025

Modalities

GLM-4.5 Price

Input PriceOutput Price
$0.4286/M$2/M

GLM-4.5 API

POST /v1beta/models/{model}:generateContent

GLM-4.5 Benchmark

GLM-4.5 (Reasoning)

19.5

/100

Artificial Analysis Intelligence Index

Artificial Analysis broad capability aggregate

Index score

26.3

/100

Artificial Analysis Coding Index

Artificial Analysis software task aggregate

Index score

73.7

/100

Artificial Analysis Math Index

Artificial Analysis math reasoning aggregate

Index score

Knowledge & Reasoning

MMLU-Pro

Advanced multi-task knowledge

83.5%

GPQA

Advanced science problem solving

78.2%

HLE

Broad expert-level exam set

12.2%

Coding & Engineering

LiveCodeBench

Live coding problems

73.8%

SciCode

Scientific coding challenges

34.8%

Terminal-Bench Hard

Hard terminal task execution

22.0%

Math

MATH-500

Advanced math problem solving

97.9%

AIME

Competition math problems

87.3%

AIME 2025

Competition math problems

73.7%

Instruction Following & Agent Tasks

IFBench

Prompt constraint adherence

44.1%

AA-LCR

Long-context reasoning

48.3%

τ²-Bench

Agent workflow tasks

43.0%

Metrics sourced from Artificial Analysis

Media and Discussions

Selected public videos and posts related to this model.

X (Twitter)

View post on X
View post on X
View post on X

Reddit

YouTube

Watch on YouTube
Watch on YouTube
Watch on YouTube

GLM-4.5 FAQ

GLM-4.5: capabilities, use cases, limits, and TokenHub guidance.

What is GLM-4.5?+

GLM-4.5 is a Z.AI model for reasoning, coding, and native agent workflows.

Which workloads suit GLM-4.5?+

Best for code reasoning, agent workflows and tool-heavy automation, especially when deep reasoning is the priority.

Which feature stands out?+

Key strength: a unified focus on reasoning, coding, and native agents.

When should teams avoid GLM-4.5?+

It belongs to an older generation and may lack newer capabilities. For the latest capabilities matter, consider GLM-5.

What should I verify in TokenHub?+

Confirm TokenHub availability; prefer the current successor for new work.