Gemini 3.1 Flash TTS Preview

Preview text-to-speech release in the Gemini Flash line.

Visit site

Preview Gemini Flash text-to-speech model release from Google for audio generation.

Pricing

Model profile · Current snapshot

Input / 1M

$0.50

Output / 1M

$3.00

Blended / 1M

$1.13

Output TPS

185

TTFT (s)

5.52

Model Intelligence

Arena ranking

Benchmarkable

Yes

Model level

release

Intelligence Index

46.4

Coding Index

42.6

Math Index

MMLU Pro

0.89

GPQA

0.9

HLE

0.35

LiveCodeBench

0.91

SciCode

0.51

AIME 2025

0.97

IFBench

0.78

LCR

0.66

TerminalBench Hard

0.39

TAU2

0.8

Recent stories

1 linked story

releasePRIMARY2026-04-15

Gemini 3.1 Flash TTS launches with Audio Tags, 70+ languages and API preview

Google released Gemini 3.1 Flash TTS with inline Audio Tags, multi-speaker control and 70+ languages, and opened preview access through the Gemini API and AI Studio with rollout to Vertex AI and Google Vids. Independent evals ranked it near the top of current speech leaderboards, but it runs slower and costs more than the leading system.