gemini-3.5-flash

Common Name: Gemini 3.5 Flash

Google
-10%On SaleReleased on May 19 12:00 AMSupportedTool InvocationSupportedReasoning
CompareTry in Chat

Google's native multimodal reasoning model based on the Gemini 3 Flash foundation, optimized for agentic workflows, coding tasks, and long-context enterprise processes with configurable thinking levels.

Specifications

Context
1000K
Maximum Output
64K
Inputtext, image, audio, video
Outputtext

Performance (7-day Average)

Collecting…
Collecting…
Collecting…

Pricing

Standard
Batch
Flex
priority
Input/MTokens
$1.35
$0.675
$0.675
$2.43
Cached Input/MTokens
$0.135
$0.0675
$0.072
$0.243
Output/MTokens
$8.10
$4.05
$4.05
$14.58
Thinking Output/MTokens
$8.10
$4.05
$4.05
$14.58

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

$1.80/$10.80/M-10%
ctx1.0Mmax64Kavailtps
InOutCap

Gemini 3 Pro Image Preview with native image generation capabilities alongside text understanding.

$0.09/$0.36/M-10%
ctx1.0Mmax66Kavailtps
InOutCap

A lightweight version of Gemini 2.5 Flash optimized for speed and cost efficiency with 1M token context support.

$0.27/$2.25/M-10%
ctx1.0Mmax66Kavailtps

Google's most efficient workhorse model designed for speed and low-cost. Improved across key benchmarks for reasoning, multimodality, code and long context while being 20-30% more efficient.

$1.125/$9.00/M-10%
ctx1.0Mmax66Kavailtps

Google's most intelligent AI model with adaptive thinking capabilities. Among the world's best models for coding and tasks requiring advanced reasoning.