No Result

View All Result

No Result

View All Result

No Result

View All Result

GLM-4.5 (Reasoning)

← AI Models

Z AI

2025-07-28

MIT

355B params

Modality:

Intelligence

26.4

#199/532

Coding

26.3

#234/86

Math

73.7

#84/265

Speed

51 tok/s

TTFT: 778.00s

Pricing

$0.49 / $1.90

per 1M tokens (in/out)

GLM-4.5 is an Agentic, Reasoning, and Coding (ARC) foundation model designed for intelligent agents, featuring 355 billion total parameters with 32 billion active parameters using MoE architecture. Trained on 23T tokens through multi-stage training, it is a hybrid reasoning model that provides two modes: thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. The model unifies agentic, reasoning, and coding capabilities with 128K context length support. It achieves exceptional performance with a score of 63.2 across 12 industry-standard benchmarks, placing 3rd among all proprietary and open-source models. Released under MIT open-source license allowing commercial use and secondary development.

GLM-4.5 (Reasoning) is Z AI’s model designed for advanced reasoning tasks. It processes at 51.229 tokens per second and is priced at $0.49 per million input tokens, targeting professional users.

When to Use GLM-4.5 (Reasoning)

✓ Best For

Complex reasoning tasks
Mathematical problem solving
Coding assistance

✗ Not Ideal For

Basic conversational applications
High-speed real-time processing

How GLM-4.5 (Reasoning) Compares

Intelligence Index · Higher is better

AlibabaAmazonZ AIOpenAIKimi

Benchmark Profile

Output Speed · tok/s

AlibabaZ AI

Math Index

AnthropicAlibabaZ AIAllen Institute for AI

Intelligence · Coding · Math

Intelligence Coding Math

All Benchmark Scores (27)

Benchmark	Score
Intelligence Index	26.4
Coding Index	26.3
Math Index	73.7
MMLU-Pro	835%
GPQA	782%
LiveCodeBench	738%
HLE	122%
SWE-Bench Verified	64.2%
SciCode	34.8%
IFBench	44.1%
LCR	48.3%
TerminalBench Hard	22%
Tau2	43%
AIME	87.3%
AIME 2025	73.7%
MATH 500	97.9%
AA-Index	67.7%
AIME 2024	91%
BFCL-v3	77.8%
BrowseComp	26.4%
Humanity's Last Exam	14.4%
MATH-500	98.2%
TAU-bench Airline	60.4%
TAU-bench Retail	79.7%
Terminal-Bench	37.5%
Arena Chat	21.4
Arena Coding	7.4

Data: Artificial Analysis · Updated: April 10, 2026

Frequently Asked Questions (15)

When was GLM-4.5 (Reasoning) released?

GLM-4.5 (Reasoning) was released on July 28, 2025.

Who created GLM-4.5 (Reasoning)?

GLM-4.5 (Reasoning) was created by Z AI.

How intelligent is GLM-4.5 (Reasoning)?

GLM-4.5 (Reasoning) scores 26 on the Artificial Analysis Intelligence Index, placing it below average among other open weight models of similar size (median: 27).

How fast is GLM-4.5 (Reasoning)?

GLM-4.5 (Reasoning) generates output at 46.4 tokens per second (based on the median across providers serving the model), which is below average compared to other open weight models of similar size (median: 57.7 t/s).

What is the latency of GLM-4.5 (Reasoning)?

GLM-4.5 (Reasoning) has a time to first token (TTFT) of 2.87s (based on the median across providers serving the model), which is at the higher end compared to other open weight models of similar size (median: 2.17s).

How much does GLM-4.5 (Reasoning) cost?

GLM-4.5 (Reasoning) costs $0.49 per 1M input tokens (better than average, median: $0.60) and $1.90 per 1M output tokens (better than average, median: $2.20), based on the median across providers serving the model.

What is GLM-4.5 (Reasoning) API pricing?

GLM-4.5 (Reasoning) costs $0.49 per 1M input tokens and $1.90 per 1M output tokens (based on the median across providers serving the model). For a blended rate (3:1 input to output ratio), this is $0.84 per 1M tokens. Pricing may vary by provider.

How verbose is GLM-4.5 (Reasoning)?

When evaluated on the Intelligence Index, GLM-4.5 (Reasoning) generated 61M output tokens, which is somewhat higher than average compared to other open weight models of similar size (median: 39M).

Is GLM-4.5 (Reasoning) a reasoning model?

Yes, GLM-4.5 (Reasoning) is a reasoning model. It uses extended thinking or chain-of-thought reasoning to work through complex problems before providing an answer.

What input modalities does GLM-4.5 (Reasoning) support?

GLM-4.5 (Reasoning) supports text input.

What output modalities does GLM-4.5 (Reasoning) support?

GLM-4.5 (Reasoning) supports text output.

Can GLM-4.5 (Reasoning) process images?

No, GLM-4.5 (Reasoning) does not support image input. It can only process text.

Is GLM-4.5 (Reasoning) multimodal?

No, GLM-4.5 (Reasoning) is not multimodal. It only supports text input.

What is the context window of GLM-4.5 (Reasoning)?

GLM-4.5 (Reasoning) has a context window of 130k tokens. This determines how much text and conversation history the model can process in a single request.

Is GLM-4.5 (Reasoning) open source?

Yes, GLM-4.5 (Reasoning) is open weights. The model weights are publicly available and can be downloaded for self-hosting.

GLM-4.5 (Reasoning)

When to Use GLM-4.5 (Reasoning)

✓ Best For

✗ Not Ideal For

How GLM-4.5 (Reasoning) Compares

Intelligence Index · Higher is better

Benchmark Profile

Output Speed · tok/s

Math Index

Intelligence · Coding · Math

All Benchmark Scores (27)

Frequently Asked Questions (15)

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

Follow Us