TOPIC50 stories

Agent runtime infrastructure

Backend primitives and platform services designed for autonomous agents as the primary consumer — agent-native storage, sandboxes, queues, and runtime infra.

Stories

Filter stories

RELEASE19th July

ChatGPT Work desktop adds cloud vs local run controls

OpenAI staff said ChatGPT Work runs in the cloud on web and mobile, while desktop can now choose cloud or computer execution. The clarification followed confusion about closed-laptop and local-environment behavior.

WORKFLOW18th July

Slate, LangChain, and AI SDK add graph control for agent runtimes

New releases and practitioner posts described graph-shaped agent runtimes instead of simple loops. Slate, LangChain's open-sourced agent factory, and AI SDK's Python package each add pieces for durable orchestration.

NEWS2w ago

Shepherd supports agent-run rollback beyond git rewind

A Stanford-linked Shepherd thread described rollback for long agent runs that restores files, packages, dev servers, and process state beyond git-style rewinds. Replies flagged KV-cache warmth and registered database inverse steps as cost and recovery boundaries.

RELEASE2w ago

AI SDK adds HarnessAgent for Pi, Claude, Codex, and OpenCode

AI SDK added HarnessAgent as a common interface for Pi, Claude, Codex, OpenCode, and other harnesses. Use it to run local or cloud software-factory jobs through official SDKs while subscriptions cover token usage.

RELEASE2w ago

Vercel adds FUSE Sandbox mounts and Agent Runs MCP/CLI access

Vercel shipped FUSE-based Sandbox mounts for S3 and network filesystems and opened Agent Runs through MCP and CLI. Use it to connect remote state, sandbox execution, and agent-readable Eve traces for self-improving workflows.

NEWS2w ago

OpenUI integrates Mastra, CopilotKit, and Eve through AG-UI generative components

Mastra published an OpenUI guide, CopilotKit showed Fable 5 agents rendering AG-UI React components, and OpenUI Adam targeted Vercel Eve. The cluster creates a shared component-streaming surface across React, design systems, and filesystem agents.

RELEASE2w ago

harbor exec launches agentic-map-reduce CLI via npx skills add harbor-exec

harbor exec launched an agentic-map-reduce CLI installed with npx skills add harbor-exec. Use it to run sandboxed agents for trace analysis, session mining, search, and rollout aggregation.

RELEASE2w ago

Browser Use CLI 3.0 releases direct CDP control with 6× smaller context

Browser Use CLI 3.0 shipped direct Chrome DevTools Protocol control through browser-harness with a 6× smaller context path. Try it with Claude Code, Codex, cloud browsers, or local Chrome sessions to cut browser-agent context overhead.

NEWS3w ago

Apify integrates x402 with 20,000 Actors for USDC-paid runs

Apify added more than 20,000 Actors to the x402 flow, letting agents pay in USDC and run tools on demand through HTTP 402 responses. That gives agents a way to buy web automation tasks without pre-provisioned API keys or a manual checkout step, so builders can test paid tool use directly.

RELEASE3w ago

Vercel AI SDK Harness API adds OpenCode and Deep Agents in one interface

Vercel extended the AI SDK Harness API to cover OpenCode and Deep Agents, adding more agent runtimes to the unified interface introduced in AI SDK 7. The change matters because apps can swap supported runtimes without rewriting integration code, though ACP is still awkward for some cloud deployments.

RELEASE3w ago

Rivet releases agentOS v0.2.0 with WebAssembly sandboxing and 1738x cheaper claim

Rivet released agentOS v0.2.0, a Rust rewrite of its WebAssembly-based sandbox and orchestration stack with multiplayer workflows and one-prompt deployment. The release targets self-hosted and cloud agent runtimes, and Rivet claims 1738x lower cost than SaaS sandboxes.

RELEASE3w ago

Vercel releases AI SDK 7 with approvals, durability, and telemetry

Vercel shipped AI SDK 7 with approvals, durability, telemetry, and other production agent primitives. Early adapter feedback points to breaking changes and migration work for SDKs that wrap the old APIs.

RELEASE4w ago

Latitude launches MIT-licensed agent monitoring with Signals clustering and MCP access

Latitude released an open-source platform for monitoring AI agents in production, with plain-English trace search, repeated-failure clustering, and MCP access from coding agents. That gives teams a self-hostable way to inspect token burn, surface recurring failures, and turn production traces into evals and fixes.

RELEASE4w ago

Google ships Interactions API in GA as Gemini default with background agents

Google put the Interactions API into GA as the new default for Gemini, adding background execution, managed agents, remote sandboxes, and multimodal tools. Builders now get one stateful interface for models, long-running jobs, and future Gemini Omni support.

NEWS4w ago

Fugu Ultra testers report 30-minute runs and 17x GLM cost after launch

Sakana launched Fugu Ultra on AI Gateway and published a technical report, with early testers sharing mixed results. Reports mention polished outputs on some tasks, but also 30-minute runs, uneven coding quality, and much higher cost than GLM-5.2.

NEWS4w ago

ComputeSDK releases 2026 100k Scale Invitational results across 6 sandbox providers

ComputeSDK published results from its 2026 100k Scale Invitational after weeks of reruns and infra tuning across Modal, Tensorlake, Northflank, Declaw AI, E2B, and Isorun. It matters because sandbox and agent infra claims now have a shared public concurrency target instead of vendor-specific load demos.

RELEASE4w ago

Omnigent opens live Claude Code and Codex sessions with phone control

Databricks open-sourced Omnigent, a meta-harness that runs Claude Code, Codex, Cursor, Pi, and custom agents in one live session with a collaborative web UI. The release centralizes supervision, cost control, and cross-agent review instead of splitting work across separate tools.

NEWS1mo ago

ENPIRE launches 8-agent Codex robot fleet for physical autoresearch

ENPIRE launched a physical autoresearch setup that gives eight Codex agents robots, GPUs, and real-world APIs for tasks like zip ties and part sorting. It matters because it moves long-horizon agent evaluation from browser-only loops into embodied experimentation with explicit safety controls.