Skip to content
AI Primer

An AI audio platform for generating, cloning, and transforming speech and other voice content.

Screenshot of ElevenLabs website

Recent stories

7 linked stories
releasePRIMARY2026-06-15
ElevenAPI launches Music v2 with inpainting and 15¢-per-minute pricing

ElevenLabs launched Music v2 on ElevenAPI with track generation, reference matching, inpainting, and multilingual output. It gives developers a priced API for commercial music creation and section-level editing.

newsSECONDARY2026-05-28
Artificial Analysis launches AA-WER Streaming with Cartesia Ink-2 at 3.7% WER

Artificial Analysis launched AA-WER Streaming to benchmark streaming speech-to-text models on accuracy and latency for voice agents. The first leaderboard puts Cartesia Ink-2 and ElevenLabs Scribe v2 on the price-latency frontier, so teams should compare cost against latency before choosing a model.

releasePRIMARY2026-05-20
ElevenLabs launches Speech Engine at 8¢ per minute for chat-to-voice agents

ElevenLabs launched Speech Engine, a layer that adds transcription, speech synthesis, turn-taking, and interruption handling on top of an existing chat agent. The release pairs SDKs, one-command setup, and 8¢-per-minute pricing for production voice agents.

newsPRIMARY2026-05-07
ElevenLabs cuts Flash TTS 55%, Scribe 45%, and Agents 20% with pay-as-you-go billing

ElevenLabs lowered self-serve pricing for ElevenAPI and ElevenAgents and added pay-as-you-go billing. The biggest listed drops are to $0.05 per 1,000 tokens for Flash TTS, $0.22 for Scribe v2 speech-to-text, and $0.08 per minute for agent calls.

newsPRIMARY2026-04-28
ElevenLabs releases Agent Templates with 50+ support, SDR, and training workflows

ElevenLabs launched Agent Templates, a library of pre-configured conversational agents for support, education, sales, and internal enablement. That shortens the setup path for teams that want to deploy voice or chat agents without starting from a blank flow.

releasePRIMARY2026-04-09
ElevenLabs adds on-prem and on-device deployment options

ElevenLabs added on-prem and on-device deployment options alongside its existing VPC and cloud paths for the voice stack. The rollout gives government, automotive, and edge teams more data-boundary choices, with VPC available now and the new modes in early access.

releaseSECONDARY2026-03-29
Mistral releases Voxtral TTS with 3-second cloning and 68.4% win rate vs ElevenLabs Flash v2.5

Voxtral TTS uses separate semantic and acoustic token models, a 2.14 kbps codec, and 3-25 second reference audio for cloning across nine languages. Try it if you want a hybrid speech pipeline with more control and faster acoustic synthesis than all-autoregressive generation.

AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.