MODEL28 stories

Gemini

Google Gemini model family.

Stories

Google introduces Gemini Intelligence on Android with browser use, AppFunctions, and Rambler

Google unveiled Gemini Intelligence at the Android Show with cross-app task automation, Gemini in Chrome, Rambler voice cleanup, custom widgets, and AppFunctions. The rollout moves Gemini into core Android workflows on Pixel and Galaxy devices this summer.

NEWS12th May

Google DeepMind tests Gemini pointer demos in AI Studio with PDF bullets and recipe doubling

Google DeepMind published Gemini pointer experiments in AI Studio that act on whatever the cursor highlights, turning PDFs, tables, images, and recipes into direct actions. The shift matters because it moves assistant UX from separate chat panes into in-place pointing and voice commands.

RELEASE7th May

Google updates Gemini Interactions API with steps schema and Api-Revision 2026-05-26

Google is replacing the Gemini Interactions API’s older outputs-and-roles structure with a steps schema for multi-step agent workflows. The change matters because SDK upgrades, migration work, and schema assumptions in existing tooling may break before the new interface reaches GA.

RELEASE7th May

Google releases Gemini 3.1 Flash Lite GA with 1M context and $0.25 input pricing

Google moved Gemini 3.1 Flash Lite from preview to GA, and OpenRouter added the model with 1 million context and low-cost multimodal pricing. The preview endpoint now has a shutdown schedule, and users should verify whether the GA model differs from the March preview.

NEWS1w ago

Gemini API adds multimodal File Search with page citations

Google expanded Gemini API File Search to index text and images together, add custom metadata filtering, and return page-level citations. RAG builders can use it for tighter retrieval control and more auditable answers.

RELEASE1w ago

Gemini API ships Webhooks and field-level Interactions errors

Google added Webhooks to the Gemini API and upgraded Interactions API errors with exact field paths, bad values, enum lists, and type mismatches. The changes target long-running tasks and agent integrations where polling and opaque validation failures slow debugging.

RELEASE2w ago

Google AI Studio adds multi-chat and web search to Build mode

Google AI Studio added multi-chat threads and web search grounding to Build mode, so Gemini coding sessions can branch while pulling live docs into the workspace. The feature improves in-browser prototyping loops, but it is currently scoped to AI Studio rather than the Gemini API itself.

NEWS2w ago

Gemini adds Grounding with Exa for websites, docs, people, and company search

Gemini models can now use Grounding with Exa to search websites, technical docs, papers, people, and companies through Exa's index. That gives Gemini a new agent-style grounding path alongside Google's first-party search tooling.

NEWS3w ago

Google launches Gemini Enterprise Agent Platform with Agent Studio and 200+ models

Google introduced Gemini Enterprise Agent Platform as the evolution of Vertex AI, with Agent Studio, shared agent management, and Model Garden access to 200-plus models. Enterprises now get one stack for building, governing, and deploying agents across Gemini and Workspace surfaces.

RELEASE3w ago

Google launches Deep Research Max with MCP, native charts, and 85.9% BrowseComp

Google added Deep Research and Deep Research Max to the Gemini API with collaborative planning, multimodal inputs, MCP support, and native charts. The agents push cited web-plus-private-data reports into developer workflows, and Max is tuned for slower overnight runs.

NEWS3w ago

Google AI Studio adds Pro and Ultra plan support with higher quotas

Google enabled Pro and Ultra subscriptions inside AI Studio, turning consumer plans into a higher-quota bridge before direct API billing. The rollout still has quota bugs and does not yet support Workspace accounts, so check access before migrating.

RELEASE4w ago

Gemini 3.1 Flash TTS launches with Audio Tags, 70+ languages and API preview

Google released Gemini 3.1 Flash TTS with inline Audio Tags, multi-speaker control and 70+ languages, and opened preview access through the Gemini API and AI Studio with rollout to Vertex AI and Google Vids. Independent evals ranked it near the top of current speech leaderboards, but it runs slower and costs more than the leading system.

RELEASE4w ago

Google DeepMind releases Gemini Robotics-ER 1.6 with 93% instrument reading

Google DeepMind shipped Gemini Robotics-ER 1.6 to the Gemini API and AI Studio with better visual-spatial reasoning, multi-view success detection, and gauge reading. The model's 93% instrument-reading score targets robots that need to reason over cluttered scenes and physical constraints.

RELEASE1mo ago

Google releases Veo 3.1 Lite in Gemini API at $0.05 per second

Google released Veo 3.1 Lite in Gemini API and AI Studio with 720p and 1080p output, 4-8 second clips, and text-to-video plus image-to-video support. Watch the April 7 Veo 3.1 Fast pricing drop if you need lower video generation costs.

RELEASE1mo ago

hankweave adds harness switching for Agents SDK, Codex, and Gemini aliases

Hankweave added short aliases that route the same prompt and code job into Anthropic's Agents SDK, Codex, or Gemini-style harnesses with unified logs and control. The release treats harness choice as a first-class variable instead of forcing teams to rebuild orchestration for each model stack.

RELEASE1mo ago

Gemini 3.1 Flash Live launches with 90.8% audio tool-use score and 128K context

Google launched Gemini 3.1 Flash Live in AI Studio, the API, and Gemini Live with stronger audio tool use, lower latency, and 128K context. Voice-agent teams should benchmark quality, latency, and thinking settings before switching.

RELEASE1mo ago

ARC Prize launches ARC-AGI-3: Gemini 3.1 Pro scores 0.37%

ARC-AGI-3 swaps static puzzles for interactive game-like environments and posts initial frontier scores below 1%, with Gemini 3.1 Pro at 0.37%. Teams can use it to inspect agent reasoning, but score interpretation still depends heavily on the human-efficiency metric and no-harness setup.

RELEASE1mo ago

Google launches Lyria 3 Pro API at $0.08 per song

Lyria 3 Pro and Lyria 3 Clip are now in Gemini API and AI Studio, with Lyria 3 Pro priced at $0.08 per song and able to structure tracks into verses and choruses. That gives developers a clearer path to longer-form music features, with watermarking and prompt design built in.

RELEASE1mo ago

Gemini API adds OpenAI-compatible Veo 3.1 video and image endpoints

Google extended its OpenAI compatibility layer so existing OpenAI SDK code can call Veo 3.1 video generation and Gemini image models with only base URL and model changes. It lowers migration cost for teams that want multimodal fallbacks without rewriting client code.

RELEASE1mo ago

Google AI Studio updates Build with Antigravity and one-click Firebase

Google rebuilt AI Studio Build around the Antigravity coding agent, one-click Firebase auth and databases, multiplayer backends, and persistent sessions. It pushes AI Studio closer to production app scaffolding and gives Firebase Studio users a clear migration path.

RELEASE1mo ago

Gemini API adds one-call tool chaining and Maps grounding for Gemini 3

Google now lets Gemini chain built-in tools like Search, Maps, File Search, and URL Context with custom functions inside a single API call. This removes orchestration glue for agent builders and brings Maps grounding into AI Studio for faster prototyping.

NEWS1mo ago

Google adds Grounding with Google Maps to AI Studio UI for Gemini APIs

Google is adding Grounding with Google Maps to AI Studio’s UI, and a Google reply says the Maps grounding capability already exists in the API. If you build location-heavy Gemini apps, start designing around map lookups instead of stitching search and geocoding manually.

NEWS2mo ago

Report: Meta reportedly delays Avocado to May after reasoning and coding tests trail Gemini 3.0

Reports say Meta pushed Avocado from March to at least May after internal reasoning, coding, and writing tests missed current frontier targets. Expect more delayed launches at the top end, and watch for products that route some features through competitor models.

RELEASE2mo ago

Google releases Gemini Embedding 2 preview with one vector space for text, image, video, audio, and PDFs

Google launched Gemini Embedding 2 in preview, unifying multiple modalities and 100+ languages in one embedding space with flexible output dimensions. Try it to simplify cross-modal RAG and search pipelines, but compare it with late-interaction systems before committing.

NEWS2mo ago

Google adds Gemini API spend caps in AI Studio with project-level dollar limits

Google AI Studio now lets developers set experimental per-project spend caps for Gemini API usage. Use it as a native billing guardrail, but account for roughly 10-minute enforcement lag and possible batch-job overshoot.

NEWS2mo ago

Google Workspace adds Gemini agents to Docs, Sheets, Slides, and Drive for Google AI Pro and Ultra

Google rolled Gemini deeper into Docs, Sheets, Slides, and Drive, including spreadsheet task execution, editable slide generation, and grounded file search. Teams should test approval flows, context sourcing, and current feature limits before broad rollout.

RELEASE2mo ago

Gemini Embedding 2 enters preview with 8,192-token multimodal vectors and 3,072-dim outputs

Google put Gemini Embedding 2 into public preview with one vector space for text, images, video, audio, and PDFs, plus 3072, 1536, and 768 output sizes. Use it to replace multi-model retrieval pipelines with one API for RAG and cross-media search.

NEWS2mo ago

Google LiteRT-LM PR adds Gemma4 NPU support ahead of an expected release

A Google bot-authored LiteRT-LM pull request references Gemma4 and AIcore NPU support, while multiple posts claim a largest version around 120B total and 15B active parameters. Engineers targeting on-device inference should wait for a formal model card before locking plans.