Skip to content
AI Primer
release

Google releases Nano Banana 2 Lite and Gemini Omni Flash

Google shipped Nano Banana 2 Lite for image generation and Gemini Omni Flash for conversational video generation and editing in the Gemini API and AI Studio. The release sets image generation at about 4 seconds and $0.034 per 1K image, while Omni Flash adds multi-turn video edits at $0.10 per second.

5 min read
Google releases Nano Banana 2 Lite and Gemini Omni Flash
Google releases Nano Banana 2 Lite and Gemini Omni Flash

TL;DR

You can jump straight to Google's blog post, skim the Nano Banana 2 Lite docs, and check the Omni Flash docs. The product connective tissue sits in AI Studio and the Interactions API, where GoogleDeepMind's workflow demo shows Google treating image generation plus conversational video editing as one stack, not two isolated releases.

What shipped

Google's official framing was simple: one low-latency image model, one video model with conversational edits.

Nano Banana 2 Lite

The image model's whole pitch is throughput. Google AI Studio's announcement called it a high-velocity model for developer pipelines, and GoogleDeepMind's speed post set the target at 4-second text-to-image output.

A few concrete numbers landed immediately:

Third-party scores made the tradeoff legible. Arena's ranking post put Nano Banana 2 Lite at #5 overall in text-to-image with a 1251 score, and fal's availability post claimed two-second generation and editing at 1K across 14 aspect ratios on its own surface.

Gemini Omni Flash

Omni Flash is the more interesting release. It turns video generation and editing into a chat loop, which is Christmas come early for people building media agents.

Google's own feature list in GoogleDeepMind's Omni Flash feature list broke the model into four capabilities:

  • Conversational video editing
  • Multimodal referencing and combined inputs
  • Real-world knowledge
  • Direct links between text, graphics, and video actions

The API details were just as specific in _philschmid's launch summary:

  • Price: $0.10 per second
  • Clip length: up to 10 seconds
  • Model slug: gemini-omni-flash-preview
  • Session behavior: each edit builds on prior context, for up to three sequential edits

Third-party ranking posts gave Google a decent day-one brag sheet. Arena's Omni Flash video edit ranking scored it 1347 in Video Edit Arena, #2 overall and nearly 40 points ahead of the next-best model there. fal's launch post also emphasized synchronized audio and scene-preserving edits from mixed inputs.

Where it shows up

Google did not keep either model inside its own UI for long.

Day-one and near-day-one rollouts included:

Early caveats

The launch posts were clean. The replies and follow-up threads were where the rough edges showed up.

According to rohanpaul_ai's caveat thread, Omni Flash currently lacks API audio reference support, and video references up to 3 seconds were documented but not yet processing correctly. fofrAI's trim and frame-rate reply added that prep helpers trim clips to 10 seconds, resize them, and reduce frame rate to 24 fps rather than increasing it.

The model lineup is also narrower than the branding might suggest. OfficialLoganK's reply said only a Flash version of Omni is available across products right now, with no parallel higher-tier Omni variant on launch day.

Hands-on posts surfaced one more limitation: DynamicWebPaige's translation demo found that video translations kept the speaker's voice, but also kept an American accent across German, French, and Hindi outputs.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 6 threads
TL;DR4 posts
What shipped3 posts
Nano Banana 2 Lite5 posts
Gemini Omni Flash4 posts
Where it shows up4 posts
Early caveats2 posts
Share on X