Google DeepMind's multimodal AI model family.

Pricing

Artificial Analysis · Apr 27, 2026, 1:00 PM

Input / 1M

$2.00

Output / 1M

$12.00

Blended / 1M

$4.50

Output TPS

TTFT (s)

Model Intelligence

Arena ranking

Benchmarkable

Model level

family

Intelligence Index

Coding Index

23.6

MMLU Pro

0.75

GPQA

0.59

HLE

0.05

LiveCodeBench

0.32

SciCode

0.3

MATH-500

0.88

AIME

0.23

Recent stories

11 linked stories

newsPRIMARY2026-05-12

Google DeepMind tests Gemini pointer control on PDFs and charts

Google DeepMind showed an experimental pointer that lets Gemini act directly on screen elements with motion, speech, and shorthand commands. The demos move assistance from chat into live workspace control, but the feature was presented as an experiment rather than a shipped product.

newsPRIMARY2026-05-11

Google reportedly tests Gemini Omni video editing with chat remix and templates

Multiple posts preview a Google video model called Gemini Omni with remix, templates, and chat editing, plus demos that keep chalkboard math readable. The clips are still unofficial, but creators are watching the text-fidelity claim closely.

workflowSECONDARY2026-05-06

Google Stitch supports 550 monthly UI generations with HTML, React, and Figma exports

A creator thread resurfaced Google Stitch as a free Labs tool that turns detailed prompts into prototypes and exports HTML, CSS, Tailwind, React, and Figma files. The prompt pack matters because it shows designers can move from one-line brief to landing pages, auth flows, dashboards, and pricing screens without starting in Figma.

workflowSECONDARY2026-05-03

Hermes supports browser-native filmmaking workflows with Syncthing handoff and taste memory

A Hermes and Kimi hackathon build mapped a local filmmaking pipeline with prompt packets, browser workers, Syncthing handoff, image ranking, and taste memory. It matters because subscription-only tools can be folded into a reusable production loop, but the taste model is still early and creator-specific.

newsSECONDARY2026-04-18

Grok Imagine Quality mode compares to Nano Banana Pro in creator side-by-sides

Several creator comparisons say Grok's Quality mode now looks close to Nano Banana Pro, especially on skin texture and realism. One Grok-compatible creator service also said it is ending its $5 plan, moving to annual pricing, and adding 9:16 support with $0.15 generations.

workflowSECONDARY2026-04-15

Freepik releases Cuco B. Hops trailer workflow with Nano Banana 2 and Seedance 2.0

Freepik published a Cuco B. Hops breakdown that moves from Nano Banana 2 character sheets to Seedance 2.0 scenes inside one workspace. Teams can use it as a repeatable template for cross-shot character consistency.

releaseSECONDARY2026-04-15

Gemini 3.1 Flash TTS adds Audio Tags, 70-language support, and SynthID

Gemini 3.1 Flash TTS added Audio Tags, 70-plus language support, and SynthID watermarking for generated speech. The preview spans Gemini API, AI Studio, Vertex AI, and Google Vids, so teams can test delivery control before adopting it.

promptPRIMARY2026-04-11

Nano Banana prompt turns brand logos into glass mockups with caustics

Amir Mushich published a Nano Banana prompt that keeps official logo geometry while rendering brands as beveled glass sculptures against an open sky. Follow-up examples showed the setup working across multiple logos with only small variable changes, so creators can reuse it for mockup work.

workflowSECONDARY2026-04-08

Seedance 2.0 adds 15s timeline prompts with extracted refs and Omni Reference

Creators documented repeatable Seedance 2.0 workflows that start with Midjourney, Nano Banana 2, or Gemini references, then use timeline prompts, frame extraction, and Omni Reference. The chains now cover action previs, music videos, and stylized scene changes, so teams can copy the workflow across editors.

workflowSECONDARY2026-04-05

Indie builders add AI gateway wrappers for per-user limits, GCP key audits, and provider routing

Three builder threads shared reusable layers around model APIs: per-user usage gateways, audits for Gemini-enabled GCP keys, and config-driven routing that swaps providers without app rewrites. Wrapping rate limits, key scope, and model choice in one layer helps teams ship multi-user apps without scattering provider logic.

releasePRIMARY2026-03-26

Gemini 3.1 Flash Live launches with 90.8% ComplexFuncBench audio score

Google says its new realtime voice model improves noisy-environment understanding, long conversations and function calling, and it's rolling into Gemini Live, Search Live and AI Studio. Voice creators can test it for lower-latency spoken interactions.