Qwen3 235B A22B (Non-reasoning)
Speed
61 tok/s
TTFT: 1.14s
Pricing
$0.70 / $2.80
per 1M tokens (in/out)
Qwen3 235B A22B is a large language model developed by Alibaba, featuring a Mixture-of-Experts (MoE) architecture with 235 billion total parameters and 22 billion activated parameters. It achieves competitive results in benchmark evaluations of coding, math, general capabilities, and more, compared to other top-tier models.
Qwen3 235B A22B (Non-reasoning) is Alibaba’s model designed for various AI applications. It processes at 60.742 tokens per second and is priced at $0.7 per million input tokens, targeting professional users.
Read more ▼
Frequently Asked Questions (15)
When was Qwen3 235B A22B (Non-reasoning) released?
Qwen3 235B A22B (Non-reasoning) was released on April 28, 2025.
Who created Qwen3 235B A22B (Non-reasoning)?
Qwen3 235B A22B (Non-reasoning) was created by Alibaba.
How intelligent is Qwen3 235B A22B (Non-reasoning)?
Qwen3 235B A22B (Non-reasoning) scores 17 on the Artificial Analysis Intelligence Index, placing it below average among other open weight non-reasoning models of similar size (median: 20).
How fast is Qwen3 235B A22B (Non-reasoning)?
Qwen3 235B A22B (Non-reasoning) generates output at 56.4 tokens per second (based on Alibaba's API), which is above average compared to other open weight non-reasoning models of similar size (median: 54.6 t/s).
What is the latency of Qwen3 235B A22B (Non-reasoning)?
Qwen3 235B A22B (Non-reasoning) has a time to first token (TTFT) of 2.75s (based on Alibaba's API), which is at the higher end compared to other open weight non-reasoning models of similar size (median: 2.25s).
How much does Qwen3 235B A22B (Non-reasoning) cost?
Qwen3 235B A22B (Non-reasoning) costs $0.70 per 1M input tokens (somewhat higher than average, median: $0.60) and $2.80 per 1M output tokens (somewhat higher than average, median: $2.33), based on Alibaba's API.
What is Qwen3 235B A22B (Non-reasoning) API pricing?
Qwen3 235B A22B (Non-reasoning) costs $0.70 per 1M input tokens and $2.80 per 1M output tokens (based on Alibaba's API). For a blended rate (3:1 input to output ratio), this is $1.23 per 1M tokens. Pricing may vary by provider.
How verbose is Qwen3 235B A22B (Non-reasoning)?
When evaluated on the Intelligence Index, Qwen3 235B A22B (Non-reasoning) generated 4.1M output tokens, which is very competitive compared to other open weight non-reasoning models of similar size (median: 9.1M).
Is Qwen3 235B A22B (Non-reasoning) a reasoning model?
No, Qwen3 235B A22B (Non-reasoning) is not a reasoning model. It provides direct responses without extended chain-of-thought reasoning.
What input modalities does Qwen3 235B A22B (Non-reasoning) support?
Qwen3 235B A22B (Non-reasoning) supports text input.
What output modalities does Qwen3 235B A22B (Non-reasoning) support?
Qwen3 235B A22B (Non-reasoning) supports text output.
Can Qwen3 235B A22B (Non-reasoning) process images?
No, Qwen3 235B A22B (Non-reasoning) does not support image input. It can only process text.
Is Qwen3 235B A22B (Non-reasoning) multimodal?
No, Qwen3 235B A22B (Non-reasoning) is not multimodal. It only supports text input.
What is the context window of Qwen3 235B A22B (Non-reasoning)?
Qwen3 235B A22B (Non-reasoning) has a context window of 33k tokens. This determines how much text and conversation history the model can process in a single request.
Is Qwen3 235B A22B (Non-reasoning) open source?
Yes, Qwen3 235B A22B (Non-reasoning) is open weights. The model weights are publicly available and can be downloaded for self-hosting.