Google's next generation family of multimodal generative AI models
Gemini is Google's next generation family of multimodal generative AI models, available through the Gemini API and Google AI Studio.
No exact model 'Gemini' with unified public pricing; pricing varies by Gemini variant (e.g., Gemini 2.5 Pro ranges $1–2.5 input/$6–15 output per 1M tokens). Platform, tier, and subscription (Google AI Pro $19.99/mo) also affect rates.
Model family pricing is available, but no per-token rate for generic 'Gemini'; see linked Google docs for latest details.
Google says its new realtime voice model improves noisy-environment understanding, long conversations and function calling, and it's rolling into Gemini Live, Search Live and AI Studio. Voice creators can test it for lower-latency spoken interactions.
Google is rolling out Lyria 3 Pro for full songs and Lyria 3 Clip for 30-second generations in the Gemini API and AI Studio. Musicians can now map intros, verses, choruses and bridges instead of stitching short music clips together.
Glass says its Mac editor can tap existing Claude, ChatGPT and Gemini subscriptions inside one coding workspace, avoiding separate API keys and usage meters. Compare the flat-subscription workflow against Cursor-style billing before you move a product build.
SentrySearch uses Gemini's native video embeddings to index footage without transcription, find matching scenes fast, and trim clips automatically. Editors can move from natural-language search to selects, rough cuts and future EDL exports with less manual logging.
A filmmaker shared a seven-step pipeline that uses Gemini for research, Nano Banana Pro for consistent scenes, Kling for image-to-video, Veo for speaking shots, and CapCut for finish. The sequence is useful if you want research, references, motion, and sound separated into controllable stages.
Photoshop on the web added AI Assistant beta for chat-based edits, and Adobe also rolled out AI Markup targeting, Firefly sync, speech generation, and Topaz Astra upscaling. Try it to rough in edits faster, target exact regions, and move drafts toward polished assets with fewer manual steps.