MODEL10 stories

Gemini

Stories, products, and related signals connected to this tag in Explore.

Stories

Google reportedly tests Gemini Omni video editing with chat remix and templates

Multiple posts preview a Google video model called Gemini Omni with remix, templates, and chat editing, plus demos that keep chalkboard math readable. The clips are still unofficial, but creators are watching the text-fidelity claim closely.

WORKFLOW6th May

Google Stitch supports 550 monthly UI generations with HTML, React, and Figma exports

A creator thread resurfaced Google Stitch as a free Labs tool that turns detailed prompts into prototypes and exports HTML, CSS, Tailwind, React, and Figma files. The prompt pack matters because it shows designers can move from one-line brief to landing pages, auth flows, dashboards, and pricing screens without starting in Figma.

RELEASE4w ago

Gemini 3.1 Flash TTS adds Audio Tags, 70-language support, and SynthID

Gemini 3.1 Flash TTS added Audio Tags, 70-plus language support, and SynthID watermarking for generated speech. The preview spans Gemini API, AI Studio, Vertex AI, and Google Vids, so teams can test delivery control before adopting it.

WORKFLOW1mo ago

Indie builders add AI gateway wrappers for per-user limits, GCP key audits, and provider routing

Three builder threads shared reusable layers around model APIs: per-user usage gateways, audits for Gemini-enabled GCP keys, and config-driven routing that swaps providers without app rewrites. Wrapping rate limits, key scope, and model choice in one layer helps teams ship multi-user apps without scattering provider logic.

RELEASE1mo ago

Gemini 3.1 Flash Live launches with 90.8% ComplexFuncBench audio score

Google says its new realtime voice model improves noisy-environment understanding, long conversations and function calling, and it's rolling into Gemini Live, Search Live and AI Studio. Voice creators can test it for lower-latency spoken interactions.

NEWS1mo ago

Glass launches Mac editor that connects Claude, ChatGPT and Gemini without API keys

Glass says its Mac editor can tap existing Claude, ChatGPT and Gemini subscriptions inside one coding workspace, avoiding separate API keys and usage meters. Compare the flat-subscription workflow against Cursor-style billing before you move a product build.

RELEASE1mo ago

Lyria 3 Pro launches 3-minute song tools in Gemini API and AI Studio

Google is rolling out Lyria 3 Pro for full songs and Lyria 3 Clip for 30-second generations in the Gemini API and AI Studio. Musicians can now map intros, verses, choruses and bridges instead of stitching short music clips together.

RELEASE1mo ago

SentrySearch ships Gemini video search with in-out timestamps and ffmpeg clip trimming

SentrySearch uses Gemini's native video embeddings to index footage without transcription, find matching scenes fast, and trim clips automatically. Editors can move from natural-language search to selects, rough cuts and future EDL exports with less manual logging.

RELEASE1mo ago

Google AI Studio adds Antigravity agent, multiplayer apps, and one-click database support

Google rolled out a Build upgrade with backend support, Google sign-in, multiplayer, and an Antigravity coding agent. Creatives can prototype collaborative apps faster, with design mode and Figma integration already on the roadmap.

WORKFLOW1mo ago

Gemini, Nano Banana Pro, Kling, and Veo power a 7-step historical short-film workflow

A filmmaker shared a seven-step pipeline that uses Gemini for research, Nano Banana Pro for consistent scenes, Kling for image-to-video, Veo for speaking shots, and CapCut for finish. The sequence is useful if you want research, references, motion, and sound separated into controllable stages.