AI audio platform for text-to-speech, voice cloning, speech-to-speech, dubbing, and conversational voice agents.

Recent stories
ElevenLabs lowered self-serve pricing for ElevenAPI and ElevenAgents and added pay-as-you-go billing. The biggest listed drops are to $0.05 per 1,000 tokens for Flash TTS, $0.22 for Scribe v2 speech-to-text, and $0.08 per minute for agent calls.
ElevenLabs launched Agent Templates, a library of pre-configured conversational agents for support, education, sales, and internal enablement. That shortens the setup path for teams that want to deploy voice or chat agents without starting from a blank flow.
ElevenLabs added on-prem and on-device deployment options alongside its existing VPC and cloud paths for the voice stack. The rollout gives government, automotive, and edge teams more data-boundary choices, with VPC available now and the new modes in early access.
Voxtral TTS uses separate semantic and acoustic token models, a 2.14 kbps codec, and 3-25 second reference audio for cloning across nine languages. Try it if you want a hybrid speech pipeline with more control and faster acoustic synthesis than all-autoregressive generation.