Orchestration
Control flow, handoffs, workflows, and agent runtime design.
Stories
Filter storiesSakana made Fugu Ultra available through Vercel AI Gateway, while new technical writeups described the trained routing head and multi-step orchestration behind it. The integration matters because teams can invoke Fugu’s model-selection workflow through existing gateway plumbing instead of standing up custom routing.
Hermes Agent launched Mixture of Agents 2.0, letting users combine models from different providers into presets that behave like a normal model inside the agent loop. It matters because multi-model orchestration becomes a reusable runtime primitive instead of a custom routing workflow.
Vercel extended the AI SDK Harness API to cover OpenCode and Deep Agents, adding more agent runtimes to the unified interface introduced in AI SDK 7. The change matters because apps can swap supported runtimes without rewriting integration code, though ACP is still awkward for some cloud deployments.
Rivet released agentOS v0.2.0, a Rust rewrite of its WebAssembly-based sandbox and orchestration stack with multiplayer workflows and one-prompt deployment. The release targets self-hosted and cloud agent runtimes, and Rivet claims 1738x lower cost than SaaS sandboxes.
Sakana AI launched Fugu and Fugu Ultra as OpenAI-compatible orchestration models that route, verify, and synthesize across multiple models. The release matters because Sakana is selling multi-agent coordination as a single endpoint, but it has not fully disclosed model mix or pass-through costs.
Builders released reusable loop artifacts this week, including a Loop Library Skill, repo templates, and published control-loop definitions for docs sweeps, onboarding checks, and error triage. It matters because teams are turning one-shot prompting into persistent agent runs with explicit stop conditions and shared repo state.
OpenHands added Agent Client Protocol support to its Agent Canvas, SDK, and Cloud, letting teams run different coding agents through one interface across local, remote, and cloud backends. The release also underpins new OpenHands Index results, so teams can compare harness-plus-model combinations instead of model-only runs.
Databricks open-sourced Omnigent, a meta-harness that runs Claude Code, Codex, Cursor, Pi, and custom agents in one live session with a collaborative web UI. The release centralizes supervision, cost control, and cross-agent review instead of splitting work across separate tools.
Flue 1.0 Beta reorganizes the framework around workflows, autonomous agents, and channel connectors while keeping model-agnostic deployment. The release gives TypeScript teams a more opinionated base for durable, long-running agents.
Factory 2.0 expands from coding agents to end-to-end software factories spanning tickets, code, tests, deployments, and incidents. The release makes feedback loops and model routing first-class parts of engineering automation.
Kilo's Product Week bundle added Agent Manager for isolated git worktrees, Kilo Console beta, REVIEWS.md memory hooks, and a balance-based MiniMax M3 plan. The bundle puts parallel agent runs, browser control, and plan provisioning into one shipped release.
Builders shipped OpenProse workflow files, ghzinga PR tabs, cmux terminal controls, datasette-agent-edit primitives, and an agent-optimized CLI fork. These pieces turn prompt strings into reusable files, panes, and testable edit loops for coding agents.
Repo Prompt Community Edition went live on GitHub as an open-source orchestration app built around MCP-first agent control, while the legacy project was archived separately. It matters because builders now get a public harness that can swap underlying CLI agents without rewriting the control surface.
CopilotKit shipped v1.59.2 with threads, Vue packages, a React Native SDK, and updated AG-UI building blocks for fullstack agent apps. The release makes it easier to ship Cursor- and Claude-like interfaces, with new work extending generative UI into Slack, Teams, and other chat surfaces.
Three days after Dynamic Workflows launched, Claude Code users reported accidental mode triggers, a 199-agent deep-research run that burned about 50 million tokens, and steep quota hits from design workflows. The complaints matter because orchestration can now dominate cost and behavior even when the underlying model is working as expected.
CopilotKit shipped an AG-UI integration that streams Claude Agent SDK agents into web and mobile frontends with generative UI and approval checkpoints. The adapter lets teams embed terminal-first Claude agents in React, Vue, Angular, and React Native without rewriting transport or state plumbing.
A day after Claude Code introduced Dynamic Workflows, builders shipped ports and clones for Codex, Conductor, and GLM-backed CC Mirror. The rapid ports turn the feature into a reusable orchestration pattern rather than an Anthropic-only runtime.
Claude Code 2.1.154 added Dynamic Workflows, a research-preview mode that writes orchestration scripts and runs hundreds of subagents in one session. Anthropic also shipped 2.1.156 to fix Opus 4.8 thinking-block API errors, so teams should watch for workflow and API stability.
Hermes Agent added an OpenHands orchestration skill that can be installed with a single command. The addition matters because Hermes can now route work across OpenHands, Claude Code, Codex, and OpenCode through the same skills interface.
The MCP 2026-07-28 release candidate makes the protocol stateless, removing the handshake and session ID. That changes how MCP servers can be load-balanced, retried, and deployed behind standard infrastructure.
Claude Code 2.1.147 added a deterministic Workflow tool, renamed `/simplify` to `/code-review`, and tightened sandboxing; 2.1.148 followed with a fix for the Bash 127 regression. The release matters because it changes multi-agent orchestration and review behavior while restoring automation reliability for existing Claude Code setups.
Warp launched Oz orchestration across Claude Code, Codex, and Warp Agent, with subagent delegation, isolated worktrees or containers, and beta multi-harness control. Try the new '&' handoff and Agent Memory if you run long sessions that need cloud continuation.
TanStack AI said its client and server are now two-way compatible with AG-UI, so TanStack clients can talk to any AG-UI server and vice versa. This matters because agent front ends and back ends can swap across framework boundaries without a migration rewrite.
Cline open-sourced the runtime behind its extension and CLI as the Cline SDK, then rebuilt the CLI on top with agent teams, cron jobs, connectors, and example apps. The harness score gives teams a new reference point if they want to compare agent tooling on Terminal-Bench 2.0.
LangChain unveiled SmithDB, LangSmith Engine, Managed Deep Agents, and GA sandboxes at Interrupt. The stack gives agent teams a purpose-built trace database, autonomous failure triage, and managed execution environments for production workflows.
Hermes Agent 0.13.0 adds enforced goal completion, Kanban-based multi-agent orchestration, and more extensibility for custom LLM providers and gateway channels. Early user reports also show Hermes browser skills cutting one Hacker News workflow from 102 seconds to 35 and cost from $1.46 to $0.28.
Nous Research added a Kanban workflow where specialized agents claim linked tasks, share files, and persist progress in SQLite-backed workspaces. The update moves Hermes from a single-agent loop to coordinated queues with human comments, heartbeats, and crash recovery.
World2Agent published a protocol for sensors that turn outside events into structured agent signals, plus a quickstart for a GitHub-based sensor. The release gives teams a standard way to pipe repos, logs, and feeds into workflows before a prompt arrives.
Builders shared concrete Symphony, create_agent, and MCP setup guides after arguing that model switching is easy but harness switching is not. The playbooks matter because they make harness engineering more repeatable, so teams can copy tested tooling and integration patterns.
Conductor 0.50 shipped steering controls, automatic repo-and-remote initialization, and direct `:line_number` jumps from agent output into diffs. Those changes reduce manual setup and make it easier to supervise long-running coding-agent sessions from inside the orchestration layer.
Stripe Projects added agent-friendly provisioning commands for OpenRouter, Daytona, Vercel, Render, and related tools. That lets agents buy model access, sandboxes, and hosting from the terminal instead of dashboard-driven setup.
Mistral Studio added a Workflows orchestration layer that tracks state, retries, branches, and human approvals in public preview. That lets long-running agent flows resume after failures instead of restarting from scratch.
Helmor launched as a local-first, open-source alternative to Conductor with one-click import and GitLab-ready model configuration. That gives teams a GUI for orchestrating coding agents without routing work through a hosted cloud.
OpenAI released Symphony, an orchestration layer that turns issue trackers into Codex agent queues for PR generation and review. Early users say it can move many tickets in parallel, but token burn rises quickly when agents fan out.
Sakana AI opened beta access to Fugu, a multi-agent orchestration system that routes work across multiple frontier models via an OpenAI-compatible API. The launch packages model selection and role assignment as an external runtime, but access is still gated by beta signup.
Cursor 3.2 added /multitask async subagents, improved worktrees, and multi-root workspaces, then paired the release with GPT-5.5 rollout at 72.8% on CursorBench. The update makes background agent orchestration a first-class IDE workflow instead of a blocking queue.
Nous Research shipped Hermes Agent v0.9.0 with a local web dashboard, new monitoring APIs, and broader platform updates. Teams using multi-agent workflows should test the new controls for profile cloning and long-running dashboard-managed sessions.
Nous said Hermes became the top coding app on OpenRouter while shipping an OpenClaw migration patch, Telegram agent-to-agent messaging, and new memory controls. If you run long-lived agents, watch the migration path and memory settings before moving chats or skills hubs.
LangChain launched Deep Agents Deploy in beta as a production path for open, model-agnostic agent harnesses configured with AGENTS.md, skills, and mcp.json. Deployments run on LangSmith and can expose MCP, A2A, and agent protocol while teams choose models and sandbox providers.
Hermes Agent v0.8.0 added remote code-execution backends, Browser Use cloud browsing, prompt caching, shared sessions, and CLI workflow upgrades like `hermes -w`. Try the new browser-backed and parallel execution paths if you need more persistent, multi-provider agent runs.
Nous Research added MiniMax M2.7, Xiaomi’s MiMo V2 Pro, a SuperMemory plugin, and expanded Manim support to Hermes through partner integrations. The additions give users new hosted model options, a shared memory backend, and more complete technical-animation tooling to try in workflows.
OpenClaw 2026.4.7 adds a headless inference hub, memory-wiki, session branch and restore, and webhook-driven TaskFlows. Composio also shipped a CLI for secure app authentication, so users can expand OpenClaw from a local coding harness into a broader agent runtime.
Builders shipped a direct Claude Code harness and a ClawHub marketplace skill for OpenClaw workflows. Use these routes to wire agent tooling into OpenClaw, but watch Claude API limits and token burn costs.
Imbue published a walkthrough for mngr showing how it turns tutorial scripts into pytest cases, runs many agents in parallel, and merges fixes back into one branch. The case study offers a repeatable pattern for evaluating agent tools, so teams can borrow the tmux capture, artifact dashboards, and local-to-Modal handoff.
Cursor 3 introduced a separate agent-first workspace that can run agents locally, in worktrees, over SSH, and in the cloud while keeping the editor available. The release gives teams a path to multi-agent orchestration without giving up the traditional IDE surface.
Hermes Agent v0.5.0 adds 400+ models via Nous Portal, Hugging Face access, Exa support, GPT-5.4 behavior tweaks, and a published changelog. The release broadens provider coverage and hardens the runtime without changing the terminal-first workflow.
Hankweave added short aliases that route the same prompt and code job into Anthropic's Agents SDK, Codex, or Gemini-style harnesses with unified logs and control. The release treats harness choice as a first-class variable instead of forcing teams to rebuild orchestration for each model stack.
CopilotKit published a walkthrough of AG-UI, an event-driven protocol that standardizes how agent frameworks stream text, tool calls, lifecycle events, and state to applications. The protocol aims to let teams swap agent backends without rewriting the UI contract.
OpenCode shipped terminal, desktop, and `opencode serve` workflows for an open-source coding agent with LSP support, plugins, and more than 75 providers. Users should look at the multi-backend web sessions, IPC plugins, and sandboxed local setup as the main differentiators.
Every opened Plus One, a hosted OpenClaw that lives in Slack, comes preloaded with internal skills, and works with a ChatGPT subscription or other API keys. It lowers the ops burden for deployed coworkers, so teams can test packaged agents before building their own stack.