Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)

← AI Models
NVIDIA
2025-05-20
Modality:
Intelligence
14.4
#358/523
Coding
Math
50
#140/265
Speed
Pricing
Google Preferred Source

Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) is NVIDIA’s AI model designed for advanced reasoning tasks. It features a Math Index of 50, indicating strong mathematical capabilities, and is priced at $0 per million tokens, making it accessible for various applications.

When to Use Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)

✓ Best For

  • Mathematical problem solving
  • Advanced reasoning tasks
  • Cost-effective AI applications

✗ Not Ideal For

  • High-speed processing needs
  • Coding tasks

How Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) Compares

Intelligence Index · Higher is better

KimiMBZUAI Institute of Foundation ModelsNVIDIAAlibaba

Benchmark Profile

Math Index

OpenAILG AI ResearchNVIDIADeepSeekInclusionAI

Intelligence · Coding · Math

Intelligence Coding Math

All Benchmark Scores (12)

BenchmarkScore
Intelligence Index 14.4
Math Index 50
MMLU-Pro 556%
GPQA 408%
LiveCodeBench 493%
HLE 51%
SciCode 10.1%
IFBench 25.5%
Tau2 11.7%
AIME 70.7%
AIME 2025 50%
MATH 500 94.7%

Data: Artificial Analysis · Updated: April 10, 2026

Frequently Asked Questions (15)

When was Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) released?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) was released on May 20, 2025.
Who created Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) was created by NVIDIA.
How intelligent is Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) scores 14 (estimated) on the Artificial Analysis Intelligence Index, placing it below average among other open weight models of similar size (median: 15).
Is Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) a reasoning model?
Yes, Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) is a reasoning model. It uses extended thinking or chain-of-thought reasoning to work through complex problems before providing an answer.
What input modalities does Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) support?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) supports text input.
What output modalities does Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) support?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) supports text output.
Can Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) process images?
No, Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) does not support image input. It can only process text.
Is Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) multimodal?
No, Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) is not multimodal. It only supports text input.
What is the context window of Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) has a context window of 130k tokens. This determines how much text and conversation history the model can process in a single request.
Is Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) open source?
Yes, Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) is open weights. The model weights are publicly available and can be downloaded for self-hosting.
How many parameters does Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) have?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) has 4.51 billion parameters.
What is the license for Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) is released under the NVIDIA Open Model License Agreement license. This license allows commercial use.
How does Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) perform on benchmarks?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) achieves a score of 14 on the Artificial Analysis Intelligence Index. This composite benchmark evaluates models across reasoning, knowledge, mathematics, and coding.
What is the knowledge cutoff date for Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) has a knowledge cutoff of June 2023. The model's training data includes information up to this date.
Is Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) available via API?
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) is an open weights model that can be self-hosted.