Skip to content
AI Primer
workflow

Gemini Omni tests avatar, scene-mood, and object edits in creator workflows

Creators used Gemini Omni in Flow for avatar generation, weather and style transformations, annotation overlays, and object edits, while others posted failures and quality gaps. Treat it as a transformation and editing model rather than a direct Seedance replacement.

6 min read
Gemini Omni tests avatar, scene-mood, and object edits in creator workflows
Gemini Omni tests avatar, scene-mood, and object edits in creator workflows

TL;DR

You can watch Google's launch thread, skim the Flow update post, and see the product surfaces on stage slides from I/O. The weird split showed up fast: one creator got clean day, style, and weather swaps, another changed hats on each clap, and someone else immediately broke the mirror VFX. There is also already a small pile of creator-native patterns, from avatar capture to annotation overlays to building custom tools inside Flow.

What shipped

Google's own framing was broad: a model that can "create anything from any input," starting with video, with character consistency, reference-based styling, and video reimagining as the first visible behaviors.

The concrete day-one surfaces were simple:

Google also kept using the "Nano Banana for video" shorthand through creators and execs, including OfficialLoganK's post, but the hands-on clips quickly made it clear that the editing surface is the part people grabbed first.

Avatars and character consistency

The cleanest creator win so far is identity persistence. venturetwins' avatar demo showed a single captured face and voice reused across multiple scenes, and ozansihay's Flow screenshot showed that Flow now has a dedicated avatar capture flow.

The early character-consistency examples clustered around four repeatable patterns:

  1. Record one selfie clip, then reuse that person as a character, according to venturetwins' avatar demo.
  2. Move the same character across lighting, weather, and style changes, as in ai_artworkgen's test.
  3. Swap outfits or scene context while keeping the subject recognizable, per GoogleDeepMind's character-consistency post.
  4. Generate fast avatar-based clips inside Flow, though they cost credits, according to bennash's avatar post.

The strongest public example was ai_artworkgen's four-way test, which kept the same subject while changing the clip to nighttime, 3D animation, and a snow scene. That kind of continuity is more useful to ad creatives and short-form teams than another pure text-to-video beauty shot.

Conversational edits

A lot of the better Omni demos started with existing footage, not a blank prompt. venturetwins' clap-to-hat demo used a live-action clip and asked for a timed subject edit, while ai_artworkgen's follow-up ran two very different passes on the same source video: a Blair Witch found-footage remix and a labeled geology explainer.

That editing behavior breaks down into a few distinct modes:

That is why several creators, including MayorKingAI and bilawalsidhu, kept arguing that Seedance is the wrong comparison frame. The clips getting shared hardest are edit passes and remixes.

Where it breaks

The failure reports are not subtle. bennash's clothing-edit complaint said patriotic outfit swaps would not go through at all, techhalla's mirror VFX test produced a visibly wrong result, and bennash's broader complaint called Omni itself a bust even while praising Flow's character tools.

The more technical complaints were about control, not raw wow factor:

That leaves a pretty specific early picture: Omni looks strongest when the ask is "transform this clip" or "keep this character, change the world," and weaker when the ask is "obey this dense production spec exactly."

The creator playbook already has a shape

The most interesting part is how quickly creators converged on reusable patterns. minchoi's roundup condensed the first day into a ten-item list, and venturetwins' feature thread mapped five hands-on behaviors that lined up closely with the official product framing.

Across those posts, the repeatable workflow menu already looks like this:

  1. Avatar from one selfie video, per minchoi's avatar example.
  2. Remove objects from live-action footage, per minchoi's remove-objects example.
  3. Add props or effects, like fireworks, per minchoi's fireworks example.
  4. Replace a character or shift scene mood, per minchoi's character-replace example.
  5. Change camera angle, per minchoi's angle-change post.
  6. Zoom from artwork into impossible macro worlds, per minchoi's Mona Lisa example.
  7. Generate explainers or short concept pieces from a single prompt, as in chrisfirst's explainer-video post.
  8. Push rapid visual transitions in POV footage, per egeberkina's biking-video test.

Flow is also starting to look like the real product wrapper around those behaviors. GoogleDeepMind's Flow update tied Omni to batch editing and improved character consistency, tranmautritam's post showed people using Flow to vibe-code custom creative tools, and bennash's avatar note added a practical caveat that fast avatar generations already consume credits.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 6 threads
TL;DR6 posts
What shipped2 posts
Avatars and character consistency3 posts
Conversational edits5 posts
Where it breaks5 posts
The creator playbook already has a shape10 posts
Share on X