Multimodal Speech To Text Voice Agents Voice AI ElevenLabs Enterprise Adoption Reliability Voxtral Mistral Agent Product Launch DX Tooling DX Cost Realtime AI Benchmarks

ElevenLabs

AI audio platform

Visit site

An AI audio platform for generating, cloning, and transforming speech and other voice content.

Recent stories

7 linked stories

releasePRIMARY2026-06-15

ElevenAPI launches Music v2 with inpainting and 15¢-per-minute pricing

ElevenLabs launched Music v2 on ElevenAPI with track generation, reference matching, inpainting, and multilingual output. It gives developers a priced API for commercial music creation and section-level editing.

newsSECONDARY2026-05-28

Artificial Analysis launches AA-WER Streaming with Cartesia Ink-2 at 3.7% WER

Artificial Analysis launched AA-WER Streaming to benchmark streaming speech-to-text models on accuracy and latency for voice agents. The first leaderboard puts Cartesia Ink-2 and ElevenLabs Scribe v2 on the price-latency frontier, so teams should compare cost against latency before choosing a model.

releasePRIMARY2026-05-20

ElevenLabs launches Speech Engine at 8¢ per minute for chat-to-voice agents

ElevenLabs launched Speech Engine, a layer that adds transcription, speech synthesis, turn-taking, and interruption handling on top of an existing chat agent. The release pairs SDKs, one-command setup, and 8¢-per-minute pricing for production voice agents.

newsPRIMARY2026-05-07

ElevenLabs cuts Flash TTS 55%, Scribe 45%, and Agents 20% with pay-as-you-go billing

ElevenLabs lowered self-serve pricing for ElevenAPI and ElevenAgents and added pay-as-you-go billing. The biggest listed drops are to $0.05 per 1,000 tokens for Flash TTS, $0.22 for Scribe v2 speech-to-text, and $0.08 per minute for agent calls.

newsPRIMARY2026-04-28

ElevenLabs releases Agent Templates with 50+ support, SDR, and training workflows

ElevenLabs launched Agent Templates, a library of pre-configured conversational agents for support, education, sales, and internal enablement. That shortens the setup path for teams that want to deploy voice or chat agents without starting from a blank flow.

releasePRIMARY2026-04-09

ElevenLabs adds on-prem and on-device deployment options

ElevenLabs added on-prem and on-device deployment options alongside its existing VPC and cloud paths for the voice stack. The rollout gives government, automotive, and edge teams more data-boundary choices, with VPC available now and the new modes in early access.

releaseSECONDARY2026-03-29

Mistral releases Voxtral TTS with 3-second cloning and 68.4% win rate vs ElevenLabs Flash v2.5

Voxtral TTS uses separate semantic and acoustic token models, a 2.14 kbps codec, and 3-25 second reference audio for cloning across nine languages. Try it if you want a hybrid speech pipeline with more control and faster acoustic synthesis than all-autoregressive generation.