Skip to content
AI Primer
release

Exa launches Agent API at less than half the cost of GPT-5.5 and Opus

Exa launched Agent, an API that combines its search stack, mixed-model orchestration, and agent harness for deep web research. Exa says it can handle Opus- and GPT-5.5-class browsing tasks at less than half the cost.

3 min read
Exa launches Agent API at less than half the cost of GPT-5.5 and Opus
Exa launches Agent API at less than half the cost of GPT-5.5 and Opus

TL;DR

You can jump straight to the launch post, and Exa spent most of the announcement budget on a specific claim: its harness plus search stack can hit frontier browsing quality without paying frontier model prices. The benchmark list is unusually explicit for a short launch thread, and ExaAILabs' use-case post says the target workloads already include finance and go-to-market research.

Mixed-model harness

The core product move is packaging Exa's own search and content-retrieval stack behind an agent endpoint instead of selling raw retrieval primitives. ExaAILabs' benchmark post says Agent combines an "agent-first" search engine, token-efficient contents, and a harness designed for efficiency.

That framing matters because Exa is not pitching a single new base model. The launch post says /agent orchestrates "a mixture of cost-effective models," which puts the product closer to a research harness with model routing than to a standalone model release.

Benchmarks and workloads

Exa attached the launch to two kinds of evidence:

The interesting part is the combination. Deep research APIs usually show up as premium, slow, expensive surfaces. WilliamBryk's thread instead frames Agent as "close to the cost of a search API," which is a much more direct pitch at teams already paying for search and retrieval infrastructure.

API availability

The release was not a waitlist announcement. ExaAILabs' availability post says Agent is available in the API today, with links to start building and the blog post.

That post also clarifies the product boundary: this is an API launch first, not a consumer-facing research app. The announcement language stays focused on builders, benchmarks, and endpoint economics, not on a chat surface or bundled UI.

Share on X