Skip to content
AI Primer
release

xAI launches Voice Agent Builder with $0.05/min pricing and SIP routing

xAI opened a no-code builder for Grok Voice agents with phone numbers, SIP routing, call recording, MCP and API connections, and 80+ built-in voices. The beta prices audio at $0.05 per minute, plus $0.01 per minute for xAI-provided telephony.

4 min read
xAI launches Voice Agent Builder with $0.05/min pricing and SIP routing
xAI launches Voice Agent Builder with $0.05/min pricing and SIP routing

TL;DR

You can watch the console flow in xAI's launch clip, skim TestingCatalog's screenshot of the new Voice Agents page, and WesRoth's walkthrough adds the deployment details that matter most for engineering teams: SIP, call transcripts, tool logs, and the telephony surcharge.

Voice Agents

The console UI in TestingCatalog's screenshot exposes Voice Agents as a beta product with prebuilt templates for customer support, sales, appointment scheduling, personal assistant, and lead qualification. The same screen also shows the broader voice stack in the sidebar: agents, text-to-speech, speech-to-text, and a voice library.

The attached demo in xAI's launch clip ends on a Deploy button, which is the clearest signal that xAI is selling this as an end-to-end surface rather than a raw model primitive.

Builder flow

Between ai_for_success's launch thread and WesRoth's walkthrough, the builder flow breaks into a short, pretty standard voice-agent stack:

  • instructions for the agent
  • connected company knowledge
  • APIs and MCP tools
  • guardrails
  • voice selection or cloning
  • phone deployment
  • recorded calls with transcripts and tool-use review

The video attached to WesRoth's walkthrough shows that last part clearly: a live phone interaction with real-time transcription and visible tool execution logs.

Pricing and routing

The pricing model is simple enough to quote in one line. Grok Voice is $0.05 per minute, with no separate platform fee, and calls placed on xAI-provided numbers add $0.01 per minute in telephony charges, according to WesRoth's pricing breakdown.

Routing is a little more interesting. WesRoth's deployment details says every account includes a phone number, while existing business numbers can be connected through SIP, which puts the product closer to a deployable contact surface than a sandbox voice demo.

Voices and connectors

The launch pitch in ai_for_success's feature rundown bundles together the two things most builders check first:

  • 25-plus supported languages
  • 80-plus built-in voices
  • optional custom voice cloning
  • Gmail integration
  • Google Calendar integration
  • Outlook integration
  • Notion integration
  • MCP support

The sample templates in TestingCatalog's console screenshot line up with that connector story. The personal assistant template explicitly points at email and calendar access from connected apps.

Outbound calling caveat

One launch wrinkle surfaced in the thread itself. In a reply about outbound calling compliance, ai_for_success said outbound calling still needs legal clearance and compliance steps, with a hope that broader support lands later.

That is not in tension with the launch materials, which focus on inbound-style phone deployment, SIP connection, and call review. It does add one concrete boundary: the beta shipped with telephony hooks, but the thread discussion suggests outbound workflows are still gated by compliance realities rather than just product switches.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 1 thread
TL;DR1 post
Share on X