Skip to content
AI Primer
TOOL14 stories

Browser Automation

Stories, products, and related signals connected to this tag in Explore.

NEWS19th May
Google introduces WebMCP with Chrome DevTools for agents and Modern Web Guidance

Google introduced WebMCP as a proposed bridge between websites and coding agents, and paired it with Chrome DevTools support for agent debugging plus Modern Web Guidance. It matters because Google is trying to standardize browser-facing agent behavior, not just model APIs.

RELEASE1w ago
OpenClaw 2026.5.18 ships Grok OAuth, Android Talk Mode, and dialog-aware browser actions

OpenClaw 2026.5.18 shipped Grok OAuth and sidecar auth fixes, realtime Android Talk Mode, Telegram forum-topic delivery fixes, and better browser dialog handling. The release removes several auth and UI dead-ends that can stall long agent runs.

RELEASE1w ago
Kimi launches Web Bridge extension with Claude Code, Cursor, and Codex support

Kimi released Web Bridge, a browser extension that lets agents search, scroll, click, type, and save repeatable skills across websites. The bridge works with Kimi Code CLI plus Claude Code, Cursor, Codex, Hermes, and other agents.

NEWS2w ago
Google introduces Gemini Intelligence on Android with browser use, AppFunctions, and Rambler

Google unveiled Gemini Intelligence at the Android Show with cross-app task automation, Gemini in Chrome, Rambler voice cleanup, custom widgets, and AppFunctions. The rollout moves Gemini into core Android workflows on Pixel and Galaxy devices this summer.

RELEASE2w ago
Hyperbrowser launches CLI with under-50ms sandboxes and hx web commands

Hyperbrowser shipped a CLI that exposes sandbox lifecycle, web fetch/search/crawl, and snapshotting from the terminal. The tool matters because it turns browser automation and forkable state into shell primitives for agent workflows.

RELEASE2w ago
OpenAI launches Codex Chrome extension for background tabs and logged-in sites

OpenAI shipped a Chrome extension for Codex on macOS and Windows that can work across logged-in sites and multiple background tabs. It should speed up testing, data entry, and other web app tasks by letting Codex run more parallel browser work.

RELEASE2w ago
Navigator n1.5 claims web computer-use Pareto gains on accuracy, latency, and cost

Yutori rolled out Navigator n1.5 as a web computer-use model and said it improves the tradeoff between accuracy, latency, and cost for browser tasks. The launch matters because related environment-generation work is aimed at the long-horizon web workflows that make computer-use agents expensive and brittle.

WORKFLOW3w ago
LangChain adds Browserbase search, fetch, and browser subagents to Deep Agents

LangChain shipped a Browserbase integration that gives Deep Agents dedicated search, fetch, and browser subagents with dashboard observability. That turns web navigation into a first-class tool path for agent workflows instead of a custom one-off browser loop.

NEWS4w ago
Sigma launches private AI browser with local OpenClaw, Gemma 4, and Qwen support

Sigma added a private AI browser mode that runs OpenClaw with local models such as Gemma 4, Qwen, and Nemotron on-device. That matters because browser automation and page context can stay local instead of being routed through a hosted agent service.

RELEASE4w ago
Droids launches Automated QA with /install-qa, browser flows, and PR reports

Factory launched Automated QA in Droids, adding /install-qa and /qa to drive apps like a real user and attach screenshots, traces, and logs to PRs. The feature packages browser-based regression testing as a built-in agent workflow.

RELEASE4w ago
Browser Use launches Browser Use Box with persistent logins and Telegram control

Browser Use launched Browser Use Box, a 24/7 Browser Harness environment with persistent logins and Telegram control. It moves browser agents off laptops and into always-on remote sessions for long-running web tasks.

RELEASE4w ago
OpenClaw 2026.4.24 adds voice-call handoff and browser recovery

OpenClaw shipped a release that routes realtime voice queries to the full agent, defaults new users to V4 Flash, and adds coordinate clicks plus stale-lock recovery for browser automation. It also fixes Telegram, Slack, MCP session, and TTS issues, so update if those flows matter to your setup.

RELEASE4w ago
Cua Driver opens macOS background app control with multi-cursor support for Claude Code and Codex

Cua Driver open-sourced a macOS driver that lets agents control apps in the background with multi-player and multi-cursor support. It matters because it turns background computer use from an app-specific feature into a reusable primitive that any agent loop can adopt.

RELEASE1mo ago
Hermes Agent launches Tool Gateway with 300+ models and bundled tools

Hermes Agent added Tool Gateway, bundling 300+ models with web, browser, image, terminal, and TTS tools behind one subscription. Firecrawl, Browser Use, Fal image models, and Gemini Voice shipped at launch.

AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.