Question 1

When was Hermes 4 - Llama-3.1 405B (Reasoning) released?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) was released on August 27, 2025.

Question 2

Who created Hermes 4 - Llama-3.1 405B (Reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) was created by Nous Research.

Question 3

How intelligent is Hermes 4 - Llama-3.1 405B (Reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) scores 19 on the Artificial Analysis Intelligence Index, placing it below average among other open weight models of similar size (median: 27).

Question 4

How fast is Hermes 4 - Llama-3.1 405B (Reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) generates output at 32.6 tokens per second (based on the median across providers serving the model), which is at the lower end compared to other open weight models of similar size (median: 54.8 t/s).

Question 5

What is the latency of Hermes 4 - Llama-3.1 405B (Reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) has a time to first token (TTFT) of 2.42s (based on the median across providers serving the model), which is somewhat higher than average compared to other open weight models of similar size (median: 2.25s).

Question 6

How much does Hermes 4 - Llama-3.1 405B (Reasoning) cost?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) costs $1.00 per 1M input tokens (somewhat higher than average, median: $0.60) and $3.00 per 1M output tokens (somewhat higher than average, median: $2.20), based on the median across providers serving the model.

Question 7

What is Hermes 4 - Llama-3.1 405B (Reasoning) API pricing?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) costs $1.00 per 1M input tokens and $3.00 per 1M output tokens (based on the median across providers serving the model). For a blended rate (3:1 input to output ratio), this is $1.50 per 1M tokens. Pricing may vary by provider.

Question 8

How verbose is Hermes 4 - Llama-3.1 405B (Reasoning)?

Accepted Answer

When evaluated on the Intelligence Index, Hermes 4 - Llama-3.1 405B (Reasoning) generated 39M output tokens, which is somewhat higher than average compared to other open weight models of similar size (median: 17M).

Question 9

Is Hermes 4 - Llama-3.1 405B (Reasoning) a reasoning model?

Accepted Answer

Yes, Hermes 4 - Llama-3.1 405B (Reasoning) is a reasoning model. It uses extended thinking or chain-of-thought reasoning to work through complex problems before providing an answer.

Question 10

What input modalities does Hermes 4 - Llama-3.1 405B (Reasoning) support?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) supports text input.

Question 11

What output modalities does Hermes 4 - Llama-3.1 405B (Reasoning) support?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) supports text output.

Question 12

Can Hermes 4 - Llama-3.1 405B (Reasoning) process images?

Accepted Answer

No, Hermes 4 - Llama-3.1 405B (Reasoning) does not support image input. It can only process text.

Question 13

Is Hermes 4 - Llama-3.1 405B (Reasoning) multimodal?

Accepted Answer

No, Hermes 4 - Llama-3.1 405B (Reasoning) is not multimodal. It only supports text input.

Question 14

What is the context window of Hermes 4 - Llama-3.1 405B (Reasoning)?

Accepted Answer

Hermes 4 - Llama-3.1 405B (Reasoning) has a context window of 130k tokens. This determines how much text and conversation history the model can process in a single request.

Question 15

Is Hermes 4 - Llama-3.1 405B (Reasoning) open source?

Accepted Answer

Yes, Hermes 4 - Llama-3.1 405B (Reasoning) is open weights. The model weights are publicly available and can be downloaded for self-hosting.

Benchmark	Score
Intelligence Index	18.6
Coding Index	16
Math Index	69.7
MMLU-Pro	829%
GPQA	727%
LiveCodeBench	686%
HLE	103%
SciCode	25.2%
IFBench	32.7%
LCR	20.7%
TerminalBench Hard	11.4%
Tau2	22.2%
AIME 2025	69.7%

Hermes 4 – Llama-3.1 405B (Reasoning)

When to Use Hermes 4 – Llama-3.1 405B (Reasoning)

✓ Best For

✗ Not Ideal For

How Hermes 4 – Llama-3.1 405B (Reasoning) Compares

Intelligence Index · Higher is better

Benchmark Profile

Coding Index

Output Speed · tok/s

Math Index

Intelligence · Coding · Math

All Benchmark Scores (13)

Frequently Asked Questions (15)

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.