No Result

View All Result

No Result

View All Result

No Result

View All Result

GLM-4.5-Air

← AI Models

Z AI

2025-07-28

MIT

106B params

Modality:

Intelligence

23.2

#227/520

Coding

23.8

#178/426

Math

80.7

#62/265

Speed

106 tok/s

TTFT: 632.00s

Pricing

$0.20 / $1.10

per 1M tokens (in/out)

GLM-4.5-Air is a more compact variant of GLM-4.5 designed for efficient Agentic, Reasoning, and Coding (ARC) applications. It features 106 billion total parameters with 12 billion active parameters using MoE architecture. Like GLM-4.5, it is a hybrid reasoning model providing thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. Despite its compact design, GLM-4.5-Air delivers competitive performance with a score of 59.8 across 12 industry-standard benchmarks, ranking 6th overall while maintaining superior efficiency. It supports 128K context length and is released under MIT open-source license allowing commercial use.

GLM-4.5-Air is Z AI’s model designed for high-performance tasks. It processes at 106.41 tokens per second and is priced at $0.2 per million input tokens, targeting professional users.

When to Use GLM-4.5-Air

✓ Best For

Complex mathematical computations.
High-speed coding tasks.
Data analysis requiring extensive context.

✗ Not Ideal For

Applications requiring low latency.
Basic conversational AI tasks.

How GLM-4.5-Air Compares

Intelligence Index · Higher is better

OpenAIAmazonZ AIKorea Telecom

Benchmark Profile

Coding Index

DeepSeekAmazonZ AIxAIMistral

Output Speed · tok/s

OpenAIMistralZ AIMiniMax

Math Index

AlibabaZ AIAnthropicMotif Technologies

Intelligence · Coding · Math

Intelligence Coding Math

All Benchmark Scores (25)

Benchmark	Score
Intelligence Index	23.2
Coding Index	23.8
Math Index	80.7
MMLU-Pro	815%
GPQA	733%
LiveCodeBench	684%
HLE	68%
SWE-Bench Verified	57.6%
SciCode	30.6%
IFBench	37.6%
LCR	43.7%
TerminalBench Hard	20.5%
Tau2	46.5%
AIME	67.3%
AIME 2025	80.7%
MATH 500	96.5%
AA-Index	64.8%
AIME 2024	89.4%
BFCL-v3	76.4%
BrowseComp	21.3%
Humanity's Last Exam	10.6%
MATH-500	98.1%
TAU-bench Airline	60.8%
TAU-bench Retail	77.9%
Terminal-Bench	30%

Data: Artificial Analysis · Updated: April 2, 2026

Frequently Asked Questions (15)

When was GLM-4.5-Air released?

GLM-4.5-Air was released on July 28, 2025.

Who created GLM-4.5-Air?

GLM-4.5-Air was created by Z AI.

How intelligent is GLM-4.5-Air?

GLM-4.5-Air scores 23 on the Artificial Analysis Intelligence Index, placing it well above average among other open weight models of similar size (median: 15).

How fast is GLM-4.5-Air?

GLM-4.5-Air generates output at 94.3 tokens per second (based on the median across providers serving the model), which is above average compared to other open weight models of similar size (median: 77.3 t/s).

What is the latency of GLM-4.5-Air?

GLM-4.5-Air has a time to first token (TTFT) of 1.24s (based on the median across providers serving the model), which is better than average compared to other open weight models of similar size (median: 1.57s).

How much does GLM-4.5-Air cost?

GLM-4.5-Air costs $0.20 per 1M input tokens (better than average, median: $0.35) and $1.10 per 1M output tokens (somewhat higher than average, median: $0.75), based on the median across providers serving the model.

What is GLM-4.5-Air API pricing?

GLM-4.5-Air costs $0.20 per 1M input tokens and $1.10 per 1M output tokens (based on the median across providers serving the model). For a blended rate (3:1 input to output ratio), this is $0.42 per 1M tokens. Pricing may vary by provider.

How verbose is GLM-4.5-Air?

When evaluated on the Intelligence Index, GLM-4.5-Air generated 68M output tokens, which is at the higher end compared to other open weight models of similar size (median: 7.3M).

Is GLM-4.5-Air a reasoning model?

Yes, GLM-4.5-Air is a reasoning model. It uses extended thinking or chain-of-thought reasoning to work through complex problems before providing an answer.

What input modalities does GLM-4.5-Air support?

GLM-4.5-Air supports text input.

What output modalities does GLM-4.5-Air support?

GLM-4.5-Air supports text output.

Can GLM-4.5-Air process images?

No, GLM-4.5-Air does not support image input. It can only process text.

Is GLM-4.5-Air multimodal?

No, GLM-4.5-Air is not multimodal. It only supports text input.

What is the context window of GLM-4.5-Air?

GLM-4.5-Air has a context window of 130k tokens. This determines how much text and conversation history the model can process in a single request.

Is GLM-4.5-Air open source?

Yes, GLM-4.5-Air is open weights. The model weights are publicly available and can be downloaded for self-hosting.

GLM-4.5-Air

When to Use GLM-4.5-Air

✓ Best For

✗ Not Ideal For

How GLM-4.5-Air Compares

Intelligence Index · Higher is better

Benchmark Profile

Coding Index

Output Speed · tok/s

Math Index

Intelligence · Coding · Math

All Benchmark Scores (25)

Frequently Asked Questions (15)

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

Follow Us