Question 1

When was Hermes 4 - Llama-3.1 405B (Non-reasoning) released?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) was released on August 27, 2025.

Question 2

Who created Hermes 4 - Llama-3.1 405B (Non-reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) was created by Nous Research.

Question 3

How intelligent is Hermes 4 - Llama-3.1 405B (Non-reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) scores 18 on the Artificial Analysis Intelligence Index, placing it below average among other open weight non-reasoning models of similar size (median: 20).

Question 4

How fast is Hermes 4 - Llama-3.1 405B (Non-reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) generates output at 33.9 tokens per second (based on the median across providers serving the model), which is at the lower end compared to other open weight non-reasoning models of similar size (median: 54.2 t/s).

Question 5

What is the latency of Hermes 4 - Llama-3.1 405B (Non-reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) has a time to first token (TTFT) of 2.53s (based on the median across providers serving the model), which is somewhat higher than average compared to other open weight non-reasoning models of similar size (median: 2.25s).

Question 6

How much does Hermes 4 - Llama-3.1 405B (Non-reasoning) cost?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) costs $1.00 per 1M input tokens (somewhat higher than average, median: $0.60) and $3.00 per 1M output tokens (somewhat higher than average, median: $2.33), based on the median across providers serving the model.

Question 7

What is Hermes 4 - Llama-3.1 405B (Non-reasoning) API pricing?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) costs $1.00 per 1M input tokens and $3.00 per 1M output tokens (based on the median across providers serving the model). For a blended rate (3:1 input to output ratio), this is $1.50 per 1M tokens. Pricing may vary by provider.

Question 8

How verbose is Hermes 4 - Llama-3.1 405B (Non-reasoning)?

Accepted Answer

When evaluated on the Intelligence Index, Hermes 4 - Llama-3.1 405B (Non-reasoning) generated 3.9M output tokens, which is very competitive compared to other open weight non-reasoning models of similar size (median: 9.1M).

Question 9

Is Hermes 4 - Llama-3.1 405B (Non-reasoning) a reasoning model?

Accepted Answer

No, Hermes 4 - Llama-3.1 405B (Non-reasoning) is not a reasoning model. It provides direct responses without extended chain-of-thought reasoning.

Question 10

What input modalities does Hermes 4 - Llama-3.1 405B (Non-reasoning) support?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) supports text input.

Question 11

What output modalities does Hermes 4 - Llama-3.1 405B (Non-reasoning) support?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) supports text output.

Question 12

Can Hermes 4 - Llama-3.1 405B (Non-reasoning) process images?

Accepted Answer

No, Hermes 4 - Llama-3.1 405B (Non-reasoning) does not support image input. It can only process text.

Question 13

Is Hermes 4 - Llama-3.1 405B (Non-reasoning) multimodal?

Accepted Answer

No, Hermes 4 - Llama-3.1 405B (Non-reasoning) is not multimodal. It only supports text input.

Question 14

What is the context window of Hermes 4 - Llama-3.1 405B (Non-reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Non-reasoning) has a context window of 130k tokens. This determines how much text and conversation history the model can process in a single request.

Question 15

Is Hermes 4 - Llama-3.1 405B (Non-reasoning) open source?

Accepted Answer

Yes, Hermes 4 - Llama-3.1 405B (Non-reasoning) is open weights. The model weights are publicly available and can be downloaded for self-hosting.

Benchmark	Score
Intelligence Index	17.6
Coding Index	18.1
Math Index	15.3
MMLU-Pro	729%
GPQA	536%
LiveCodeBench	546%
HLE	42%
SciCode	34.6%
IFBench	34.8%
LCR	20%
TerminalBench Hard	9.8%
Tau2	26.6%
AIME 2025	15.3%

Hermes 4 – Llama-3.1 405B (Non-reasoning)

When to Use Hermes 4 – Llama-3.1 405B (Non-reasoning)

✓ Best For

✗ Not Ideal For

How Hermes 4 – Llama-3.1 405B (Non-reasoning) Compares

Intelligence Index · Higher is better

Benchmark Profile

Coding Index

Output Speed · tok/s

Math Index

Intelligence · Coding · Math

All Benchmark Scores (13)

Frequently Asked Questions (15)

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.