Skip to content
AI Primer
breaking

Google DeepMind tests Gemini pointer demos in AI Studio with PDF bullets and recipe doubling

Google DeepMind published Gemini pointer experiments in AI Studio that act on whatever the cursor highlights, turning PDFs, tables, images, and recipes into direct actions. The shift matters because it moves assistant UX from separate chat panes into in-place pointing and voice commands.

3 min read
Google DeepMind tests Gemini pointer demos in AI Studio with PDF bullets and recipe doubling
Google DeepMind tests Gemini pointer demos in AI Studio with PDF bullets and recipe doubling

TL;DR

You can try the experiments through Google AI Studio, read the official DeepMind blog post, and cross-check the Chrome angle in Google's Gemini in Chrome post. GoogleDeepMind's main demo shows the pointer turning highlighted text into a tweet draft, while GoogleDeepMind's second demo uses the same pattern on handwritten notes and paused video frames.

Pointing as context

The core claim is simple: the cursor stops being a location marker and becomes a selector for model context. In GoogleDeepMind's code-block example, hovering over part of a code snippet is enough to trigger an explanation flow for that exact block.

Google's examples all follow the same interaction loop:

  • point at something on screen
  • let Gemini infer the local context
  • issue a short spoken or typed command
  • get the result inline, next to the pointer or in a side panel

That is a cleaner UX than the usual copy-paste into a chat box, which is why kimmonismus' reaction immediately framed it as a possible break from classic chatbot windows.

Shorthand actions

The interesting part is the action inventory. Across GoogleDeepMind's launch thread and GoogleDeepMind's follow-up, the demos show at least six concrete transforms:

  1. PDF passage to bullet points for an email
  2. Table to pie chart
  3. Recipe to doubled ingredient list
  4. Scribbled note to interactive to-do list
  5. Paused restaurant video frame to booking link
  6. Selected text to tweet draft

Google also says the interface is built around natural shorthand instead of full prompts. In GoogleDeepMind's shorthand explanation, the examples are short imperative phrases like "fix this," "move that," and "book this table," with the pointer supplying the missing nouns.

AI Studio and Chrome

The product status is still experimental. demishassabis called it a prototype, while GoogleDeepMind's thread sends users to Google AI Studio rather than announcing a standalone product.

One extra wrinkle is surface area. koltregaskes' roundup pairs the pointer blog with Google's separate Gemini in Chrome post, which suggests the broader story is Gemini moving closer to the browser and the cursor, not just a one-off research video.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 1 thread
Pointing as context1 post
Share on X