Agent Product Launch
Stories about new releases or feature additions for a specific coding-agent product (e.g. Claude Code memory, Codex computer-use, Hermes Agent v0.9.0).
Stories
Filter storiesA day after MiniMax M3 launched, OpenCode, Hermes Agent, Flowith, Atomic Chat, Kilo Code, Cloudflare AI Gateway, and Vercel AI Gateway shipped support. That breadth shows M3 plugged into agent harnesses and routing layers immediately, not just its own API.
Alibaba released Qwen 3.7 Plus as a multimodal agent model for GUI, CLI, coding, and browser tasks. It ships with browser demos and immediate Cline support, giving teams another frontier-style agent model to compare against M3 and closed-source tools.
Nous Research moved Hermes Agent's native Windows build out of beta with direct PowerShell installation and a dedicated guide. Windows users now have a first-party install path instead of relying on WSL or other workarounds.
OpenClaw 2026.5.28 added Claude Opus 4.8 and Krea support while cutting fresh-install size 52.8% and speeding both cold and warm turns. It also expanded /subagents inspection, which should make delegated runs easier to debug.
Claude Code 2.1.154 added Dynamic Workflows, a research-preview mode that writes orchestration scripts and runs hundreds of subagents in one session. Anthropic also shipped 2.1.156 to fix Opus 4.8 thinking-block API errors, so teams should watch for workflow and API stability.
Anthropic released Claude Opus 4.8 across Claude, the API, and major clouds with higher coding scores and a cheaper 2.5x-speed Fast mode. Use it for coding workloads that want better benchmark performance without a price increase over 4.7.
xAI broadened Grok Build Beta while Toad and Kilo Code shipped direct support and published concrete build demos. That matters because Grok Build is moving from a standalone beta into terminal, editor, and web workflows engineers can actually wire into daily use.
Rollout posts say Grok Build CLI is reaching SuperGrok and X Premium+ users beyond the earlier higher tier. That broadens access to xAI's command-line agent and X search client without a new API launch.
Watchers spotted claude-mythos-1-preview references in Claude, Claude Code, and Claude Security, with one screenshot also showing adaptive thinking. That matters because Anthropic appears to be testing a coding- and security-focused access path before any wider rollout.
Grok Build 0.1.218 shipped shortcut and help fixes, while early testers reported strong terminal UX but missing long-run control, browser use, and reliable self-verification. That matters because xAI is already competitive on TUI ergonomics even as core agent controls remain incomplete.
OpenClaw 2026.5.22 shipped leaner gateway and model startup paths, bringing /models to about 5 ms, while also adding locked dependency shrinkwraps and safer Windows rollbacks. That matters because it targets both startup latency and release-install trust for local agent operators.
Cursor opened a Python and TypeScript SDK for building custom agents on Composer 2.5 and paired the launch with a 90% usage discount for the long weekend. Artificial Analysis data still shows Composer 2.5 leading on cost per task, making the SDK launch an efficiency play for builders.
Letta Code can now run fully locally with an embedded server, removing the login and Docker requirement while keeping memory sync via `/memory-repository`. That gives developers a local-first agent harness with optional Ollama and LM Studio support instead of forcing everything through Letta’s hosted API.
LangChain opened a private beta for Managed Deep Agents, a model-agnostic deployment layer built on deepagents with durable execution, sandboxes, and a context hub. The release turns deep-agent rollout into a single config-and-deploy flow and adds an auth proxy boundary for agent actions.
Cognition added native Windows VMs to Devin so it can build, run, and test Windows applications with MSBuild, IIS, PowerShell, and SQL Server. The rollout lets Devin handle enterprise codebases where Linux sandboxes are not enough.
xAI put Grok Build 0.1 into the API with 256K context and $1 per million input plus $2 output pricing, while OpenCode and Kilo wired it into coding workflows. Early web-build tests in Kilo landed around $0.07 to $0.14 per task.
Zed v1.3.5 adds Terminal Threads, turning CLI agents and long-running shell jobs into managed sidebar threads. Zed says this becomes the main path for Claude Code inside the editor as older subscription sign-in flows change.
ElevenLabs launched Speech Engine, a layer that adds transcription, speech synthesis, turn-taking, and interruption handling on top of an existing chat agent. The release pairs SDKs, one-command setup, and 8¢-per-minute pricing for production voice agents.
OpenClaw 2026.5.19 moves Android Talk Mode onto realtime Gateway relay voice sessions and adds device-code xAI OAuth for headless machines, alongside Telegram and browser-dialog fixes. The update tightens remote agent usability across voice, auth, and browser actions.
Google launched Antigravity 2.0 as a desktop app plus CLI/SDK stack for multi-agent workflows, and added Managed Agents to the Gemini API with persistent Linux sandboxes. Try it for agent orchestration and API-based sandboxing, but verify harness costs and runtime fit.
A day after leaks previewed Spark, Google officially launched Gemini Spark as a persistent personal agent that runs on dedicated cloud VMs and will connect to MCP tools. It matters because Google is moving Gemini from chat responses toward long-running delegated work across consumer and enterprise surfaces.
Cursor released Composer 2.5 in its editor and says it is stronger on long-running tasks, with included usage doubled for a week. Early comparisons place it near Opus 4.7-class coding, and Cursor says a much larger model is still training with 10x more compute.
Cognition launched Devin Auto-Triage to watch issues across Slack, Linear, GitHub, schedules, webhooks, and observability tools. Teams can use it as an always-on investigation flow that returns context, next steps, or a PR.
OpenClaw 2026.5.18 shipped Grok OAuth and sidecar auth fixes, realtime Android Talk Mode, Telegram forum-topic delivery fixes, and better browser dialog handling. The release removes several auth and UI dead-ends that can stall long agent runs.
GitHub made remote control generally available for Copilot CLI and code sessions, so users can monitor runs, approve actions, and answer prompts remotely. That turns long-running coding jobs into asynchronous workflows instead of terminal-bound sessions.
Manus upgraded scheduled work so recurring jobs can continue inside the same task and drive background updates in Manus-built web apps. That matters because long-lived automations can retain context between runs instead of rebuilding state each time.
Nous Research shipped Hermes Agent v0.14.0 with Grok subscription access, Codex as an OpenAI runtime, LINE, native video generation, and a Windows beta. This matters because Hermes is moving beyond point integrations into a broader agent runtime with new access paths and deployment surfaces.
Nous Research expanded Hermes Agent so X Premium+ and SuperGrok logins can unlock Grok 4.3, X Search, and media tools without separate keys. Bookmarks and full X API access still sit outside the OAuth path.
Nous Research added SuperGrok support to Hermes Agent, letting users plug a Grok subscription directly into the framework. It broadens Hermes beyond OpenAI runtimes and local setups into another mainstream agent model path.
Zed users can now sign in with a ChatGPT subscription and use the same OpenAI limits they get in Codex, alongside ACP, Codex CLI, or API-key flows. It removes a separate billing step for teams switching between editor-native and Codex-native workflows.
OpenAI rolled out Codex in the ChatGPT mobile app, letting users start work, review outputs, approve steps, and steer remote sessions from iPhone or Android. The preview keeps execution on a laptop, Mac mini, devbox, or SSH target while syncing screenshots, diffs, and terminal state back to mobile.
Hermes Agent can now route core tool calls through the Codex app-server when it is using OpenAI models. The integration gives Hermes users access to Codex runtime behavior with a `hermes update`, without changing the rest of their agent stack.
A week after Personal Computer launched on Mac, Perplexity added Snowflake as a live data source for Computer. The integration pushes the product into governed analytics workflows, while admins still control access, definitions, and shared data logic.
Kimi released Web Bridge, a browser extension that lets agents search, scroll, click, type, and save repeatable skills across websites. The bridge works with Kimi Code CLI plus Claude Code, Cursor, Codex, Hermes, and other agents.
xAI’s early Grok Build beta adds a coding CLI with plan mode, skills, plugins, and parallel subagents for app building and workflow automation. It gives the coding-agent field another serious terminal product, but the SuperGrok Heavy paywall sharply limits real-world evaluation.
Linear added Code Intelligence so Linear Agent can use repositories as shared product context for the wider team. The public beta is free on Business and Enterprise plans, and early reactions describe it as codebase navigation for non-engineers.
Cline open-sourced the runtime behind its extension and CLI as the Cline SDK, then rebuilt the CLI on top with agent teams, cron jobs, connectors, and example apps. The harness score gives teams a new reference point if they want to compare agent tooling on Terminal-Bench 2.0.
holaOS shipped Beta 0.1, adding Multi Workspaces, Sub Agents, a dashboard, and a kickoff flow on top of its agent-computer base. The release targets long-running workstreams that need persistent context instead of one-chat sessions.
OpenAI launched a 30-day migration offer that grants eligible enterprise customers two free months of Codex usage for new users. The promotion is meant to pull coding teams onto Codex as rival agent workflows get more expensive.
Anthropic rolled fast mode for Opus 4.7 into Claude Code and tools including Cursor, v0, Droid, Conductor, and OpenRouter. Use it where latency matters, but watch pricing: Cursor disclosed a 6x multiplier and others treat it as premium.
Anthropic shipped Claude Code 2.1.140 with a /goal fix for hook-restricted sessions, case-insensitive subagent matching, and prompt/token reductions. The update should reduce failures in managed settings and background runs.
OpenAI showed Codex working across apps in the background without taking over the Mac, and early users applied it to Telegram BotFather setup and front-end testing. That matters because Codex is moving from repo-only work into authenticated desktop workflows and UI-driven task loops.
Claude Code 2.1.139 shipped a research-preview agent view plus a `/goal` mode that keeps working across turns while showing elapsed time, turns, and token counts. The update turns parallel Claude sessions into a built-in control plane, so teams can drop tmux-and-scripts workarounds.
Anthropic made Claude Platform on AWS generally available, exposing the native Claude API with AWS authentication, billing, CloudTrail, and commitment retirement. It lets teams use Managed Agents and related Claude features inside existing AWS governance workflows.
Nous Research added early computer-use support to Hermes Agent through CUA, enabling background desktop control without taking over keyboard, mouse, or screen input. The feature opens computer use to local or alternative models instead of tying the workflow to frontier-only modes.
Hermes Agent added an official LINE gateway and OpenRouter published Pareto Code setup docs while users shared Discord and mobile SSH/TUI workflows. The change matters because Hermes is moving from ranking chatter into more concrete distribution channels and repeatable operator setups.
Amp’s sqs said the team paused adding more users to the Amp Neo beta to improve stability while early testers kept posting real-project demos. The update matters because it turns yesterday’s scaling complaints into an explicit access constraint for the remote coding-agent beta.
OpenCode made Ring 2.6 1T available in the editor with reasoning enabled and free access for a limited period. Follow-on posts from Kilo and others claim frontier-level results on AIME 26, ClawEval, Gaia2-search, and Tau2-Bench Telecom.
Nous said Hermes Agent hit No. 1 among AI apps on OpenRouter after v0.13.0 shipped and added credential pools for rotating provider keys. Independent posts also tracked migrations from OpenClaw and early routing support in the same stack.
Amp paused wider Neo rollout after hitting scaling issues, but beta users still showed remote sessions running from a home Mac mini through the web UI, including over airplane Wi-Fi. That makes Neo notable as a local-hosted coding-agent model, even if the control plane is not yet stable enough for broader access.