Skip to content
AI Primer
release

ElevenLabs launches Speech Engine at 8¢ per minute for chat-to-voice agents

ElevenLabs launched Speech Engine, a layer that adds transcription, speech synthesis, turn-taking, and interruption handling on top of an existing chat agent. The release pairs SDKs, one-command setup, and 8¢-per-minute pricing for production voice agents.

3 min read
ElevenLabs launches Speech Engine at 8¢ per minute for chat-to-voice agents
ElevenLabs launches Speech Engine at 8¢ per minute for chat-to-voice agents

TL;DR

You can browse the product page, jump straight to the Speech Engine cookbook, and watch the ai.engineer London walkthrough. The interesting bit is how little of the existing agent stack ElevenLabs wants touched: ElevenLabs' integration post says the text agent stays untouched, while ElevenLabsDevs' server SDK post says sendResponse() already accepts streamed output from three major model vendors.

Speech Engine

ElevenLabs is selling this as an overlay, not a new agent framework.

The core pitch is consistent across both launch threads:

  • keep the existing chat agent and LLM logic in place
  • let ElevenLabs handle speech-to-text and text-to-speech
  • add turn-taking and interruption handling in the middle
  • stream replies back over a WebSocket connection

That makes this a fairly opinionated voice runtime for teams that already have a text agent and do not want to rewire the stack around a different orchestration system.

SDKs and scaffolding

The implementation details are more concrete than the headline.

The launch materials break the developer path into a few pieces:

Languages, pricing, and migration

The rest of the launch fills in the production story.

ElevenLabs says Speech Engine supports expressive voices in 70-plus languages, with transcription tuned for conversational latency and messy real-world audio, according to ElevenLabs' languages post and ElevenLabs' transcription post. Pricing starts at 8 cents per minute through ElevenAPI, with lower rates at scale, per ElevenLabs' pricing post.

The enterprise checklist is also explicit. ElevenLabs' security post lists SOC 2, HIPAA, GDPR, EU data residency, and zero retention mode. For teams that want more than the API layer, ElevenLabs' migration post says Speech Engine projects can move into ElevenAgents later for deployment channels, monitoring, analytics, and the broader agent toolset.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 4 threads
TL;DR5 posts
Speech Engine2 posts
SDKs and scaffolding3 posts
Languages, pricing, and migration3 posts
Share on X