Realtime AI Cost Optimization Coding Agents Multimodal Gemini Evals Benchmarks DX Cost Agent Infrastructure Multi-Agent Systems Agent Product Launch Antigravity

Gemini 2.5 Flash

Fast, cost-efficient multimodal model

Visit site

Gemini 2.5 Flash is a Google DeepMind multimodal Gemini 2.5 model release optimized for fast, cost-efficient use and exposed through the Gemini API.

Pricing

Model profile · Current snapshot

Input / 1M

$0.30

Output / 1M

$2.50

Blended / 1M

$0.85

Output TPS

212

TTFT (s)

0.39

Model Intelligence

Context window

1,048,576 tokens

Arena ranking

Benchmarkable

Yes

Model level

release

Intelligence Index

14.1

Math Index

60.3

MMLU Pro

0.81

GPQA

0.68

HLE

0.05

LiveCodeBench

0.5

SciCode

0.29

MATH-500

0.93

AIME

0.5

AIME 2025

0.6

IFBench

0.39

LCR

0.46

TerminalBench Hard

0.12

TAU2

0.15

Recent stories

2 linked stories

releasePRIMARY2026-05-19

Gemini 3.5 Flash ships with 76.2% Terminal-Bench 2.1 and $1.50/$9 pricing

Google shipped Gemini 3.5 Flash as a GA model with 1M context, 65K max output, and stronger agentic benchmarks than Gemini 3.1 Pro. Watch task-level cost, since third-party evals show it can exceed Gemini 3.1 Pro and GPT-5.5 Medium on some jobs.

releaseSECONDARY2026-05-19

Google launches Antigravity 2.0 with CLI, SDK, and single-call Managed Agents

Google launched Antigravity 2.0 as a desktop app plus CLI/SDK stack for multi-agent workflows, and added Managed Agents to the Gemini API with persistent Linux sandboxes. Try it for agent orchestration and API-based sandboxing, but verify harness costs and runtime fit.