TOPIC23 stories

Browser Automation

Stories, products, and related signals connected to this tag in Explore.

Stories

VS Code ships Copilot browser tools for web app validation

VS Code shipped a Copilot update with browser-agent tools for web app validation, plus a preview Agents window for parallel workflows, BYOK model discovery, and cost visibility. Version 1.128 also adds grouped and workspace-less chats in the Agents window.

RELEASE3rd July

Browser Use CLI 3.0 releases direct CDP control with 6× smaller context

Browser Use CLI 3.0 shipped direct Chrome DevTools Protocol control through browser-harness with a 6× smaller context path. Try it with Claude Code, Codex, cloud browsers, or local Chrome sessions to cut browser-agent context overhead.

RELEASE2w ago

Gemini 3.5 Flash adds Computer Use with 78.4 OSWorld score

Google released built-in Computer Use for Gemini 3.5 Flash across browser, mobile, and desktop. Try it for agent workflows, but watch for timeout issues on long design-from-scratch runs.

RELEASE3w ago

OpenAI opens Codex Computer Use and Chrome extension in the EEA, UK, and Switzerland

OpenAI expanded Codex in Europe with Computer Use, the Chrome extension, Memory, and Chronicle. The rollout broadens browser and desktop automation outside the U.S., though some memory features remain opt-in or preview-only.

WORKFLOW4w ago

Codex users report Appshots, browser control, and parallel PR workflows

OpenAI shipped a docs agent that can hand off guides to Codex, and users published Appshots, browser-control, parallel PR, and multi-tree workflows. Watch the examples for ways to structure Codex around orchestrated tasks, while code-review and plugin gaps remain.

RELEASE4w ago

Kimi Work launches desktop agent with 300 local swarms and WebBridge

Kimi Work launched for macOS and Windows with up to 300 local agents, WebBridge browser control, finance data tools, and a memory system. It matters because the app bundles multi-agent desktop automation locally instead of routing the workflow through a hosted IDE.

NEWS1mo ago

Browser Use adds cloud profiles and geo proxies, with 484 browsers in <2s

Browser Use launched synced cloud profiles for logged-in sessions, added geo-targeted proxies, and showed a 484-browser startup demo that finished in under two seconds. The update matters because hosted browser agents can now keep authenticated state and regional routing without custom session-management work.

RELEASE1mo ago

Browser Use launches browser infrastructure at $0.02/hour with subsecond cold starts

Browser Use rebuilt its runtime around a custom Chromium fork, Firecracker fork, and custom Linux kernel, claiming $0.02 per hour pricing with subsecond cold starts. The shift targets the infrastructure bottlenecks behind browser agents rather than model quality alone.

RELEASE1mo ago

Hyperbrowser launches AgentRank to test Claude, GPT, and Gemini on real websites

Hyperbrowser launched AgentRank, an open-source tool that runs Claude, GPT, and Gemini agents against a site to show where they get stuck. It matters because teams can turn agent website compatibility into a repeatable eval instead of an anecdotal demo.

NEWS1mo ago

Google introduces WebMCP with Chrome DevTools for agents and Modern Web Guidance

Google introduced WebMCP as a proposed bridge between websites and coding agents, and paired it with Chrome DevTools support for agent debugging plus Modern Web Guidance. It matters because Google is trying to standardize browser-facing agent behavior, not just model APIs.

RELEASE1mo ago

OpenClaw 2026.5.18 ships Grok OAuth, Android Talk Mode, and dialog-aware browser actions

OpenClaw 2026.5.18 shipped Grok OAuth and sidecar auth fixes, realtime Android Talk Mode, Telegram forum-topic delivery fixes, and better browser dialog handling. The release removes several auth and UI dead-ends that can stall long agent runs.

RELEASE1mo ago

Kimi launches Web Bridge extension with Claude Code, Cursor, and Codex support

Kimi released Web Bridge, a browser extension that lets agents search, scroll, click, type, and save repeatable skills across websites. The bridge works with Kimi Code CLI plus Claude Code, Cursor, Codex, Hermes, and other agents.

NEWS1mo ago

Google introduces Gemini Intelligence on Android with browser use, AppFunctions, and Rambler

Google unveiled Gemini Intelligence at the Android Show with cross-app task automation, Gemini in Chrome, Rambler voice cleanup, custom widgets, and AppFunctions. The rollout moves Gemini into core Android workflows on Pixel and Galaxy devices this summer.

RELEASE2mo ago

Hyperbrowser launches CLI with under-50ms sandboxes and hx web commands

Hyperbrowser shipped a CLI that exposes sandbox lifecycle, web fetch/search/crawl, and snapshotting from the terminal. The tool matters because it turns browser automation and forkable state into shell primitives for agent workflows.

RELEASE2mo ago

OpenAI launches Codex Chrome extension for background tabs and logged-in sites

OpenAI shipped a Chrome extension for Codex on macOS and Windows that can work across logged-in sites and multiple background tabs. It should speed up testing, data entry, and other web app tasks by letting Codex run more parallel browser work.

RELEASE2mo ago

Navigator n1.5 claims web computer-use Pareto gains on accuracy, latency, and cost

Yutori rolled out Navigator n1.5 as a web computer-use model and said it improves the tradeoff between accuracy, latency, and cost for browser tasks. The launch matters because related environment-generation work is aimed at the long-horizon web workflows that make computer-use agents expensive and brittle.

WORKFLOW2mo ago

LangChain adds Browserbase search, fetch, and browser subagents to Deep Agents

LangChain shipped a Browserbase integration that gives Deep Agents dedicated search, fetch, and browser subagents with dashboard observability. That turns web navigation into a first-class tool path for agent workflows instead of a custom one-off browser loop.

NEWS2mo ago

Sigma launches private AI browser with local OpenClaw, Gemma 4, and Qwen support

Sigma added a private AI browser mode that runs OpenClaw with local models such as Gemma 4, Qwen, and Nemotron on-device. That matters because browser automation and page context can stay local instead of being routed through a hosted agent service.

RELEASE2mo ago

Browser Use launches Browser Use Box with persistent logins and Telegram control

Browser Use launched Browser Use Box, a 24/7 Browser Harness environment with persistent logins and Telegram control. It moves browser agents off laptops and into always-on remote sessions for long-running web tasks.

RELEASE2mo ago

Droids launches Automated QA with /install-qa, browser flows, and PR reports

Factory launched Automated QA in Droids, adding /install-qa and /qa to drive apps like a real user and attach screenshots, traces, and logs to PRs. The feature packages browser-based regression testing as a built-in agent workflow.

RELEASE2mo ago

OpenClaw 2026.4.24 adds voice-call handoff and browser recovery

OpenClaw shipped a release that routes realtime voice queries to the full agent, defaults new users to V4 Flash, and adds coordinate clicks plus stale-lock recovery for browser automation. It also fixes Telegram, Slack, MCP session, and TTS issues, so update if those flows matter to your setup.

RELEASE2mo ago

Cua Driver opens macOS background app control with multi-cursor support for Claude Code and Codex

Cua Driver open-sourced a macOS driver that lets agents control apps in the background with multi-player and multi-cursor support. It matters because it turns background computer use from an app-specific feature into a reusable primitive that any agent loop can adopt.

RELEASE2mo ago

Hermes Agent launches Tool Gateway with 300+ models and bundled tools

Hermes Agent added Tool Gateway, bundling 300+ models with web, browser, image, terminal, and TTS tools behind one subscription. Firecrawl, Browser Use, Fal image models, and Gemini Voice shipped at launch.