Skip to content
AI Primer

Explore what's new in AI

Where people deep in AI come to stay current.

Filters

Category

Tags

Breaking

ElevenLabs launches Speech Engine at 8¢ per minute for chat-to-voice agents

ElevenLabs launched Speech Engine, a layer that adds transcription, speech synthesis, turn-taking, and interruption handling on top of an existing chat agent. The release pairs SDKs, one-command setup, and 8¢-per-minute pricing for production voice agents.

ElevenLabs launches Speech Engine at 8¢ per minute for chat-to-voice agents
New
Voice Agents·20th May·3 min read
Breaking

Thinking Machines introduces interaction models with 200 ms full-duplex audio, video, and tool use

Thinking Machines previewed interaction models that process audio, video, and text in 200 ms micro-turns, letting the system listen, speak, and react at the same time. The demos matter because the interaction loop is trained into the model instead of stitched together from separate speech and tool layers.

Thinking Machines introduces interaction models with 200 ms full-duplex audio, video, and tool use
New
Multimodal·1w ago·6 min read
Breaking

OpenAI adds GPT-Realtime-2, Translate, and Whisper to the Realtime API

OpenAI added GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper to the Realtime API. The update gives voice agents live reasoning, translation, and transcription, but it remains API-only rather than part of ChatGPT voice mode.

OpenAI adds GPT-Realtime-2, Translate, and Whisper to the Realtime API
New
Voice Agents·2w ago·6 min read
See all stories →

Briefs forMay 21

AI Primer mascot

Daily AI Digest

Get the best stories delivered
to your inbox

Skills Spotlighttop by stars

AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.