Gemini 3.5 Flash

← AI Models
Google
2026-05-19
Context: 1M
Modality:
Intelligence
55.3
#10/529
Coding
45
#32/435
Math
Speed
279 tok/s
TTFT: 14.17s
Pricing
$1.50 / $9.00
per 1M tokens (in/out)
Google Preferred Source

Gemini 3.5 Flash is Google’s latest AI model designed for various applications. It processes at a speed of 279.461 tokens per second and is priced at $1.5 per million input tokens, targeting professional users.

When to Use Gemini 3.5 Flash

✓ Best For

  • Data analysis and processing tasks.
  • Software development and coding assistance.
  • Mathematical problem-solving.

✗ Not Ideal For

  • High-complexity mathematical computations.
  • Users requiring extremely fast response times.

How Gemini 3.5 Flash Compares

Intelligence Index · Higher is better

OpenAIAlibabaGoogleMiniMax

Benchmark Profile

Coding Index

XiaomiOpenAIGoogleAlibaba

Output Speed · tok/s

OpenAIGoogleAmazon

Intelligence · Coding · Math

Intelligence Coding Math

All Benchmark Scores (10)

BenchmarkScore
Intelligence Index 55.3
Coding Index 45
GPQA 922%
HLE 41%
SciCode 53.1%
IFBench 76.3%
LCR 69.3%
TerminalBench Hard 40.9%
Tau2 95.3%
Arena Coding 16.3

Data: Artificial Analysis · Updated: May 19, 2026

Frequently Asked Questions (15)

When was Gemini 3.5 Flash released?
Gemini 3.5 Flash was released on May 19, 2026.
Who created Gemini 3.5 Flash?
Gemini 3.5 Flash was created by Google.
How intelligent is Gemini 3.5 Flash?
Gemini 3.5 Flash scores 55 on the Artificial Analysis Intelligence Index, placing it well above average among other reasoning models in a similar price tier (median: 36).
How fast is Gemini 3.5 Flash?
Gemini 3.5 Flash generates output at 284.2 tokens per second (based on Google's API), which is well above average compared to other reasoning models in a similar price tier (median: 63.7 t/s).
What is the latency of Gemini 3.5 Flash?
Gemini 3.5 Flash has a time to first token (TTFT) of 18.55s (based on Google's API), which is at the higher end compared to other reasoning models in a similar price tier (median: 2.70s).
How much does Gemini 3.5 Flash cost?
Gemini 3.5 Flash costs $1.50 per 1M input tokens (better than average, median: $1.65) and $9.00 per 1M output tokens (somewhat higher than average, median: $8.00), based on Google's API.
What is Gemini 3.5 Flash API pricing?
Gemini 3.5 Flash costs $1.50 per 1M input tokens and $9.00 per 1M output tokens (based on Google's API). For a blended rate (3:1 input to output ratio), this is $3.38 per 1M tokens. Pricing may vary by provider.
How verbose is Gemini 3.5 Flash?
When evaluated on the Intelligence Index, Gemini 3.5 Flash generated 73M output tokens, which is somewhat higher than average compared to other reasoning models in a similar price tier (median: 36M).
Is Gemini 3.5 Flash a reasoning model?
Yes, Gemini 3.5 Flash is a reasoning model. It uses extended thinking or chain-of-thought reasoning to work through complex problems before providing an answer.
What input modalities does Gemini 3.5 Flash support?
Gemini 3.5 Flash supports text and image input.
What output modalities does Gemini 3.5 Flash support?
Gemini 3.5 Flash supports text output.
Can Gemini 3.5 Flash process images?
Yes, Gemini 3.5 Flash supports image input and can analyze, describe, and answer questions about images.
Is Gemini 3.5 Flash multimodal?
Yes, Gemini 3.5 Flash is multimodal. It can process text and image input and generate text output.
What is the context window of Gemini 3.5 Flash?
Gemini 3.5 Flash has a context window of 1.0M tokens. This determines how much text and conversation history the model can process in a single request.
Is Gemini 3.5 Flash open source?
No, Gemini 3.5 Flash is proprietary. The model weights are not publicly available.