Realtime AICost OptimizationCoding AgentsMultimodalGeminiEvalsBenchmarksDX CostAgent InfrastructureMulti-Agent SystemsAgent Product LaunchAntigravity
Gemini 2.5 Flash
Fast, cost-efficient multimodal model
Gemini 2.5 Flash is a Google DeepMind multimodal Gemini 2.5 model release optimized for fast, cost-efficient use and exposed through the Gemini API.
Pricing
Model profile · Current snapshot
Input / 1M
$0.30
Output / 1M
$2.50
Blended / 1M
$0.85
Output TPS
212
TTFT (s)
0.39
Model Intelligence
Context window
1,048,576 tokens
Arena ranking
14
Benchmarkable
Yes
Model level
release
Intelligence Index
14.1
Math Index
60.3
MMLU Pro
0.81
GPQA
0.68
HLE
0.05
LiveCodeBench
0.5
SciCode
0.29
MATH-500
0.93
AIME
0.5
AIME 2025
0.6
IFBench
0.39
LCR
0.46
TerminalBench Hard
0.12
TAU2
0.15
Recent stories
2 linked stories
releasePRIMARY2026-05-19
Gemini 3.5 Flash ships with 76.2% Terminal-Bench 2.1 and $1.50/$9 pricing
Google shipped Gemini 3.5 Flash as a GA model with 1M context, 65K max output, and stronger agentic benchmarks than Gemini 3.1 Pro. Watch task-level cost, since third-party evals show it can exceed Gemini 3.1 Pro and GPT-5.5 Medium on some jobs.
releaseSECONDARY2026-05-19
Google launches Antigravity 2.0 with CLI, SDK, and single-call Managed Agents
Google launched Antigravity 2.0 as a desktop app plus CLI/SDK stack for multi-agent workflows, and added Managed Agents to the Gemini API with persistent Linux sandboxes. Try it for agent orchestration and API-based sandboxing, but verify harness costs and runtime fit.