Google's Gemini AI software product for chat and multimodal assistance; no separate official Google source for a distinct 'Gemini Omni' product was verifiable with the available research access.

Recent stories
Creators are using Gemini Omni to read a reference design and generate a final prompt for another video model while preserving face, voice, lip sync, and gestures. Use it to separate style translation from generation, but plan around the current 10-second output limit.
Creators compared Gemini Omni camera-path renders with Earth Studio output and shared zero-gravity, photo-roll, and other footage-edit demos. The tests matter because they frame Omni as a footage-transform and shot-planning tool, with output details still drifting between runs.
Creator posts showed Gemini Omni handling 3D camera trajectories, tracked label overlays, and character-sheet swaps from single references. That widens Omni from scene edits into repeatable previsualization and explainer workflows, though the evidence is still mostly community demos.