No Result

View All Result

No Result

View All Result

No Result

View All Result

GLM-5.1 (Non-reasoning)

← AI Models

Z AI

2026-04-07

MIT

Modality:

Intelligence

43.8

#58/523

Coding

35.8

#86/429

Math

—

Speed

55 tok/s

TTFT: 1.02s

Pricing

$1.40 / $4.40

per 1M tokens (in/out)

GLM-5.1 is Z.AI’s next-generation flagship foundation model designed for long-horizon agentic engineering tasks. Built on a 754B MoE architecture (40B active parameters), it can work continuously and autonomously on a single task for up to 8 hours, completing the full loop from planning and execution to iterative optimization and delivery. GLM-5.1 achieves state-of-the-art on SWE-Bench Pro (58.4) and demonstrates strong performance across coding, reasoning, and agentic benchmarks. It supports 200K context length, 128K max output tokens, thinking mode, function calling, structured output, context caching, and MCP integration. Overall performance is aligned with Claude Opus 4.6 with particular strengths in sustained execution and complex engineering optimization.

GLM-5.1 (Non-reasoning) is Z AI’s model designed for various applications in natural language processing. It processes at 54.891 tokens per second and is priced at $1.4 per million input tokens, targeting professional users.

When to Use GLM-5.1 (Non-reasoning)

✓ Best For

Text generation and summarization.
Chatbot and virtual assistant development.
Content creation for marketing and social media.

✗ Not Ideal For

Complex mathematical problem-solving.
Tasks requiring advanced reasoning capabilities.

How GLM-5.1 (Non-reasoning) Compares

Intelligence Index · Higher is better

xAIKwaiKATZ AIAlibabaXiaomi

Benchmark Profile

Coding Index

Z AIOpenAIXiaomiMistral

Output Speed · tok/s

AlibabaAnthropicZ AI

Intelligence · Coding · Math

Intelligence Coding Math

All Benchmark Scores (9)

Benchmark	Score
Intelligence Index	43.8
Coding Index	35.8
GPQA	839%
HLE	256%
SciCode	36.1%
IFBench	52%
LCR	44.3%
TerminalBench Hard	35.6%
Tau2	97.1%

Data: Artificial Analysis · Updated: April 10, 2026

Frequently Asked Questions (15)

When was GLM-5.1 (Non-reasoning) released?

GLM-5.1 (Non-reasoning) was released on April 7, 2026.

Who created GLM-5.1 (Non-reasoning)?

GLM-5.1 (Non-reasoning) was created by Z AI.

How intelligent is GLM-5.1 (Non-reasoning)?

GLM-5.1 (Non-reasoning) scores 44 on the Artificial Analysis Intelligence Index, placing it well above average among other open weight non-reasoning models of similar size (median: 21).

How fast is GLM-5.1 (Non-reasoning)?

GLM-5.1 (Non-reasoning) generates output at 46.2 tokens per second (based on the median across providers serving the model), which is below average compared to other open weight non-reasoning models of similar size (median: 52.1 t/s).

What is the latency of GLM-5.1 (Non-reasoning)?

GLM-5.1 (Non-reasoning) has a time to first token (TTFT) of 1.60s (based on the median across providers serving the model), which is better than average compared to other open weight non-reasoning models of similar size (median: 2.15s).

How much does GLM-5.1 (Non-reasoning) cost?

GLM-5.1 (Non-reasoning) costs $1.40 per 1M input tokens (at the higher end, median: $0.60) and $4.40 per 1M output tokens (at the higher end, median: $2.40), based on the median across providers serving the model.

What is GLM-5.1 (Non-reasoning) API pricing?

GLM-5.1 (Non-reasoning) costs $1.40 per 1M input tokens and $4.40 per 1M output tokens (based on the median across providers serving the model). For a blended rate (3:1 input to output ratio), this is $2.15 per 1M tokens. Pricing may vary by provider.

How verbose is GLM-5.1 (Non-reasoning)?

When evaluated on the Intelligence Index, GLM-5.1 (Non-reasoning) generated 76M output tokens, which is at the higher end compared to other open weight non-reasoning models of similar size (median: 9.2M).

Is GLM-5.1 (Non-reasoning) a reasoning model?

No, GLM-5.1 (Non-reasoning) is not a reasoning model. It provides direct responses without extended chain-of-thought reasoning.

What input modalities does GLM-5.1 (Non-reasoning) support?

GLM-5.1 (Non-reasoning) supports text input.

What output modalities does GLM-5.1 (Non-reasoning) support?

GLM-5.1 (Non-reasoning) supports text output.

Can GLM-5.1 (Non-reasoning) process images?

No, GLM-5.1 (Non-reasoning) does not support image input. It can only process text.

Is GLM-5.1 (Non-reasoning) multimodal?

No, GLM-5.1 (Non-reasoning) is not multimodal. It only supports text input.

What is the context window of GLM-5.1 (Non-reasoning)?

GLM-5.1 (Non-reasoning) has a context window of 200k tokens. This determines how much text and conversation history the model can process in a single request.

Is GLM-5.1 (Non-reasoning) open source?

Yes, GLM-5.1 (Non-reasoning) is open weights. The model weights are publicly available and can be downloaded for self-hosting.

GLM-5.1 (Non-reasoning)

When to Use GLM-5.1 (Non-reasoning)

✓ Best For

✗ Not Ideal For

How GLM-5.1 (Non-reasoning) Compares

Intelligence Index · Higher is better

Benchmark Profile

Coding Index

Output Speed · tok/s

Intelligence · Coding · Math

All Benchmark Scores (9)

Frequently Asked Questions (15)

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

Follow Us