Question 1

When was Qwen3.5 4B (Reasoning) released?

Accepted Answer

Qwen3.5 4B (Reasoning) was released on March 2, 2026.

Question 2

Who created Qwen3.5 4B (Reasoning)?

Accepted Answer

Qwen3.5 4B (Reasoning) was created by Alibaba.

Question 3

How intelligent is Qwen3.5 4B (Reasoning)?

Accepted Answer

Qwen3.5 4B (Reasoning) scores 27 on the Artificial Analysis Intelligence Index, placing it well above average among other open weight models of similar size (median: 15).

Question 4

How fast is Qwen3.5 4B (Reasoning)?

Accepted Answer

Qwen3.5 4B (Reasoning) generates output at 207.2 tokens per second (based on the median across providers serving the model), which is well above average compared to other open weight models of similar size (median: 96.9 t/s).

Question 5

What is the latency of Qwen3.5 4B (Reasoning)?

Accepted Answer

Qwen3.5 4B (Reasoning) has a time to first token (TTFT) of 0.64s (based on the median across providers serving the model), which is very competitive compared to other open weight models of similar size (median: 1.90s).

Question 6

How much does Qwen3.5 4B (Reasoning) cost?

Accepted Answer

Qwen3.5 4B (Reasoning) costs $0.03 per 1M input tokens (very competitive, median: $0.18) and $0.15 per 1M output tokens (very competitive, median: $0.40), based on the median across providers serving the model.

Question 7

What is Qwen3.5 4B (Reasoning) API pricing?

Accepted Answer

Qwen3.5 4B (Reasoning) costs $0.03 per 1M input tokens and $0.15 per 1M output tokens (based on the median across providers serving the model). For a blended rate (3:1 input to output ratio), this is $0.06 per 1M tokens. Pricing may vary by provider.

Question 8

How verbose is Qwen3.5 4B (Reasoning)?

Accepted Answer

When evaluated on the Intelligence Index, Qwen3.5 4B (Reasoning) generated 240M output tokens, which is at the higher end compared to other open weight models of similar size (median: 23M).

Question 9

Is Qwen3.5 4B (Reasoning) a reasoning model?

Accepted Answer

Yes, Qwen3.5 4B (Reasoning) is a reasoning model. It uses extended thinking or chain-of-thought reasoning to work through complex problems before providing an answer.

Question 10

What input modalities does Qwen3.5 4B (Reasoning) support?

Accepted Answer

Qwen3.5 4B (Reasoning) supports text, image, and video input.

Question 11

What output modalities does Qwen3.5 4B (Reasoning) support?

Accepted Answer

Qwen3.5 4B (Reasoning) supports text output.

Question 12

Can Qwen3.5 4B (Reasoning) process images?

Accepted Answer

Yes, Qwen3.5 4B (Reasoning) supports image input and can analyze, describe, and answer questions about images.

Question 13

Is Qwen3.5 4B (Reasoning) multimodal?

Accepted Answer

Yes, Qwen3.5 4B (Reasoning) is multimodal. It can process text, image, and video input and generate text output.

Question 14

What is the context window of Qwen3.5 4B (Reasoning)?

Accepted Answer

Qwen3.5 4B (Reasoning) has a context window of 260k tokens. This determines how much text and conversation history the model can process in a single request.

Question 15

Is Qwen3.5 4B (Reasoning) open source?

Accepted Answer

Yes, Qwen3.5 4B (Reasoning) is open weights. The model weights are publicly available and can be downloaded for self-hosting.

Benchmark	Score
Intelligence Index	27.1
Coding Index	17.5
GPQA	771%
HLE	78%
SciCode	16.1%
IFBench	52%
LCR	55.7%
TerminalBench Hard	18.2%
Tau2	92.1%

Qwen3.5 4B (Reasoning)

When to Use Qwen3.5 4B (Reasoning)

✓ Best For

✗ Not Ideal For

How Qwen3.5 4B (Reasoning) Compares

Intelligence Index · Higher is better

Benchmark Profile

Coding Index

Output Speed · tok/s

Intelligence · Coding · Math

All Benchmark Scores (9)

Frequently Asked Questions (15)

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.