Skip to content
AI Primer

DiffusionGemma

An experimental open model that explores an exceptionally fast approach to text generation

DiffusionGemma is an experimental open-weights generative model family from Google DeepMind based on the 26B A4B Mixture-of-Experts Gemma 4 architecture. It uses discrete text diffusion to generate blocks of tokens in parallel, supports multimodal text, image, and video inputs, and generates text output with up to a 256K-token context length.

Pricing

Official site · Jul 2, 2026, 7:01 AM
Pricing notes were collected, but there are no normalized numeric fields to display yet.

No public numeric pricing found in first-party sources; no per-use rate is stated on the official announcement page. Recorded as non-normalized because current public pricing appears to be unavailable.

Official Google/Google DeepMind materials do not publish any public usage price for DiffusionGemma. Based on the official product announcement, no numeric per-token, per-image, or subscription pricing could be confirmed; the model appears to be offered as open weights rather than a priced hosted API.

View source

Model Intelligence

Context window
256,000 tokens
Benchmarkable
No
Model level
family

Recent stories

2 linked stories
AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.