TOPIC40 stories

Search

Search systems, retrieval quality, and query handling.

Stories

Firecrawl launches Research Index with /search/research API and 18% arXivQA recall gain

Firecrawl released a research-specific search index with 3M+ arXiv papers, GitHub artifacts, and a /search/research interface across API, CLI, MCP, and SDKs. It combines literature retrieval, claim verification, and code lookup in one surface for research agents.

RELEASE1w ago

Exa launches Agent API at less than half the cost of GPT-5.5 and Opus

Exa launched Agent, an API that combines its search stack, mixed-model orchestration, and agent harness for deep web research. Exa says it can handle Opus- and GPT-5.5-class browsing tasks at less than half the cost.

RELEASE2w ago

Perplexity Computer adds native Deep Research with Search as Code

Perplexity made Deep Research a native skill inside Computer and tied it to the same harness, long-running sandboxes, tools, connectors, and licensed data. The update collapses multi-step research into one persistent agent interface instead of a separate mode.

RELEASE3w ago

Perplexity Computer adds hybrid agentic inference with local-cloud model splits

Perplexity said Computer will split tasks between on-device models and frontier cloud models, keeping some data on the local machine while escalating harder work remotely. That matters for privacy-sensitive workflows and for reducing token-heavy cloud usage on laptop-class hardware.

RELEASE3w ago

Perplexity launches Search as Code in Agent API with WANDR 0.386 and Python search pipelines

Perplexity replaced one-shot search calls with Search as Code, a Python-based search runtime in its Agent API that is also now the default in Computer. The change matters because agents can batch, rank, filter, and aggregate search steps inside code, and Perplexity says the system scored 0.386 on WANDR versus 0.152 for the next system.

RELEASE4w ago

Firecrawl launches /monitor webhooks with up to 90% lower token use

Firecrawl launched /monitor, a URL watcher that only pings agents when tracked pages actually change and can send results by webhook. Use it for change-only ingestion to cut LLM token spend on monitored pages.

NEWS4w ago

Firecrawl integrates into Vercel Marketplace with scraping, search, and dynamic-site access

Firecrawl is now available through Vercel Marketplace and Agent Marketplace for apps and agents that need live web data. The integration reduces setup friction for teams adding scraping, search, and structured retrieval to deployed AI workflows.

NEWS4w ago

SynthID adds OpenAI, ElevenLabs, and Kakao partners as Search and Chrome gain verification

Google expanded SynthID with new model partners and pushed verification into Search, Chrome, and Pixel video provenance flows. That matters because AI-content authentication is moving from isolated model outputs into mainstream browser and distribution surfaces.

NEWS1mo ago

Turbopuffer reports $100M run-rate and a 95% Cursor code-search cost cut

Turbopuffer said it crossed a $100M run-rate while staying profitable on less than $1M raised, and said Cursor moved production search onto the stack with a 95% cost reduction. The milestone matters because AI products increasingly compete on retrieval quality and cost, not just model output.

RELEASE1mo ago

OpenRouter adds openrouter:web_search and Parallel results at $0.005 per request

OpenRouter replaced its old web plugin path with agentic web search and fetch tools that use a common schema across models. Migrate to the new tools if you need multi-search turns, domain filtering, or Parallel/exa-native routing.

RELEASE1mo ago

Hermes Agent supports X Premium+ login with Grok 4.3 and X Search

Nous Research expanded Hermes Agent so X Premium+ and SuperGrok logins can unlock Grok 4.3, X Search, and media tools without separate keys. Bookmarks and full X API access still sit outside the OAuth path.

RELEASE1mo ago

Firecrawl adds Highlights to /scrape with 100x fewer tokens

Firecrawl added a Highlights mode to /scrape that returns matching text, code, or tables for a query instead of full-page payloads. The release matters because the company benchmarked the feature on 10,000 URLs against Exa Highlights and aims it at lower-token agent retrieval.

RELEASE1mo ago

Perplexity adds Finance Search to Agent API with live data and FinSearchComp T1 lead

Perplexity added Finance Search to the Agent API with licensed real-time market data and cited web sources in one tool call. The company says it led FinSearchComp T1 on live-data accuracy and lowest cost per correct answer, so teams building finance agents should evaluate it against their current stack.

RELEASE1mo ago

Firecrawl adds Question format to /scrape with grounded answers and 100x fewer tokens

Firecrawl introduced a /scrape mode that answers a question directly from a URL instead of returning chunks for a separate retrieval loop. It targets docs and pricing pages, and teams should use it when they want grounded answers with lower token usage.

RELEASE1mo ago

TinyFish opens Search and Fetch for free with MCP, CLI, and <0.5 s p50

TinyFish opened its Search and Fetch features for free with generous rate limits across REST, MCP, CLI, and SDKs. The change gives agent builders cheaper web retrieval while returning structured search JSON or rendered markdown instead of raw HTML.

RELEASE2mo ago

Google AI Studio adds multi-chat and web search to Build mode

Google AI Studio added multi-chat threads and web search grounding to Build mode, so Gemini coding sessions can branch while pulling live docs into the workspace. The feature improves in-browser prototyping loops, but it is currently scoped to AI Studio rather than the Gemini API itself.

RELEASE2mo ago

GitHub Copilot adds semantic indexing to all workspaces and cross-repo search in @code

GitHub expanded semantic indexing beyond GitHub and Azure DevOps remotes, so Copilot can search across more workspace types and repositories inside @code. That improves agent context retrieval in local workflows, while the same release also adds chat-history recall and prompt-eval tooling.

NEWS2mo ago

Gemini adds Grounding with Exa for websites, docs, people, and company search

Gemini models can now use Grounding with Exa to search websites, technical docs, papers, people, and companies through Exa's index. That gives Gemini a new agent-style grounding path alongside Google's first-party search tooling.

RELEASE2mo ago

LightOn releases LateOn and DenseOn at 149M params with BEIR 57.22

LightOn open-sourced DenseOn and LateOn plus the training pipeline behind them, including 1.4 billion query-document pairs and decontaminated BEIR results. Teams can use the small open retrieval models and reproduced data mixtures instead of opaque closed-data baselines.

NEWS2mo ago

OpenRouter adds Firecrawl web search with full-page markdown grounding

OpenRouter added Firecrawl as a search provider, letting models ground responses in scraped full web pages instead of snippet-only search. The launch folds crawling into the existing plugin settings flow and includes a capped free plan on the Firecrawl side.

WORKFLOW2mo ago

LongTracer opens local STS+NLI claim checks for RAG validation

LongTracer open-sourced local STS+NLI claim checks, while qi published a private search engine with a Claude Code plugin and LM Studio users shared MCP search configs for Qwen. Use these stacks to ground retrieval and verify answers without a second judge model.