Orchestration
Control flow, handoffs, workflows, and agent runtime design.
Stories
Filter storiesLangChain unveiled SmithDB, LangSmith Engine, Managed Deep Agents, and GA sandboxes at Interrupt. The stack gives agent teams a purpose-built trace database, autonomous failure triage, and managed execution environments for production workflows.
Cline open-sourced the runtime behind its extension and CLI as the Cline SDK, then rebuilt the CLI on top with agent teams, cron jobs, connectors, and example apps. The harness score gives teams a new reference point if they want to compare agent tooling on Terminal-Bench 2.0.
Hermes Agent 0.13.0 adds enforced goal completion, Kanban-based multi-agent orchestration, and more extensibility for custom LLM providers and gateway channels. Early user reports also show Hermes browser skills cutting one Hacker News workflow from 102 seconds to 35 and cost from $1.46 to $0.28.
Nous Research added a Kanban workflow where specialized agents claim linked tasks, share files, and persist progress in SQLite-backed workspaces. The update moves Hermes from a single-agent loop to coordinated queues with human comments, heartbeats, and crash recovery.
World2Agent published a protocol for sensors that turn outside events into structured agent signals, plus a quickstart for a GitHub-based sensor. The release gives teams a standard way to pipe repos, logs, and feeds into workflows before a prompt arrives.
Builders shared concrete Symphony, create_agent, and MCP setup guides after arguing that model switching is easy but harness switching is not. The playbooks matter because they make harness engineering more repeatable, so teams can copy tested tooling and integration patterns.
Conductor 0.50 shipped steering controls, automatic repo-and-remote initialization, and direct `:line_number` jumps from agent output into diffs. Those changes reduce manual setup and make it easier to supervise long-running coding-agent sessions from inside the orchestration layer.
Stripe Projects added agent-friendly provisioning commands for OpenRouter, Daytona, Vercel, Render, and related tools. That lets agents buy model access, sandboxes, and hosting from the terminal instead of dashboard-driven setup.
Mistral Studio added a Workflows orchestration layer that tracks state, retries, branches, and human approvals in public preview. That lets long-running agent flows resume after failures instead of restarting from scratch.
Helmor launched as a local-first, open-source alternative to Conductor with one-click import and GitLab-ready model configuration. That gives teams a GUI for orchestrating coding agents without routing work through a hosted cloud.
OpenAI released Symphony, an orchestration layer that turns issue trackers into Codex agent queues for PR generation and review. Early users say it can move many tickets in parallel, but token burn rises quickly when agents fan out.
Sakana AI opened beta access to Fugu, a multi-agent orchestration system that routes work across multiple frontier models via an OpenAI-compatible API. The launch packages model selection and role assignment as an external runtime, but access is still gated by beta signup.
Cursor 3.2 added /multitask async subagents, improved worktrees, and multi-root workspaces, then paired the release with GPT-5.5 rollout at 72.8% on CursorBench. The update makes background agent orchestration a first-class IDE workflow instead of a blocking queue.
Nous Research shipped Hermes Agent v0.9.0 with a local web dashboard, new monitoring APIs, and broader platform updates. Teams using multi-agent workflows should test the new controls for profile cloning and long-running dashboard-managed sessions.
Nous said Hermes became the top coding app on OpenRouter while shipping an OpenClaw migration patch, Telegram agent-to-agent messaging, and new memory controls. If you run long-lived agents, watch the migration path and memory settings before moving chats or skills hubs.
LangChain launched Deep Agents Deploy in beta as a production path for open, model-agnostic agent harnesses configured with AGENTS.md, skills, and mcp.json. Deployments run on LangSmith and can expose MCP, A2A, and agent protocol while teams choose models and sandbox providers.
Hermes Agent v0.8.0 added remote code-execution backends, Browser Use cloud browsing, prompt caching, shared sessions, and CLI workflow upgrades like `hermes -w`. Try the new browser-backed and parallel execution paths if you need more persistent, multi-provider agent runs.
OpenClaw 2026.4.7 adds a headless inference hub, memory-wiki, session branch and restore, and webhook-driven TaskFlows. Composio also shipped a CLI for secure app authentication, so users can expand OpenClaw from a local coding harness into a broader agent runtime.
Nous Research added MiniMax M2.7, Xiaomi’s MiMo V2 Pro, a SuperMemory plugin, and expanded Manim support to Hermes through partner integrations. The additions give users new hosted model options, a shared memory backend, and more complete technical-animation tooling to try in workflows.
Builders shipped a direct Claude Code harness and a ClawHub marketplace skill for OpenClaw workflows. Use these routes to wire agent tooling into OpenClaw, but watch Claude API limits and token burn costs.
Imbue published a walkthrough for mngr showing how it turns tutorial scripts into pytest cases, runs many agents in parallel, and merges fixes back into one branch. The case study offers a repeatable pattern for evaluating agent tools, so teams can borrow the tmux capture, artifact dashboards, and local-to-Modal handoff.
Cursor 3 introduced a separate agent-first workspace that can run agents locally, in worktrees, over SSH, and in the cloud while keeping the editor available. The release gives teams a path to multi-agent orchestration without giving up the traditional IDE surface.
Hermes Agent v0.5.0 adds 400+ models via Nous Portal, Hugging Face access, Exa support, GPT-5.4 behavior tweaks, and a published changelog. The release broadens provider coverage and hardens the runtime without changing the terminal-first workflow.
Hankweave added short aliases that route the same prompt and code job into Anthropic's Agents SDK, Codex, or Gemini-style harnesses with unified logs and control. The release treats harness choice as a first-class variable instead of forcing teams to rebuild orchestration for each model stack.
CopilotKit published a walkthrough of AG-UI, an event-driven protocol that standardizes how agent frameworks stream text, tool calls, lifecycle events, and state to applications. The protocol aims to let teams swap agent backends without rewriting the UI contract.
OpenCode shipped terminal, desktop, and `opencode serve` workflows for an open-source coding agent with LSP support, plugins, and more than 75 providers. Users should look at the multi-backend web sessions, IPC plugins, and sandboxed local setup as the main differentiators.
Cline launched Kanban, a local multi-agent board that runs Claude, Codex, and Cline CLI tasks in isolated worktrees with dependency chains and diffs. Teams can use it as a visual control layer for parallel coding agents on repo chores that split cleanly.
Every opened Plus One, a hosted OpenClaw that lives in Slack, comes preloaded with internal skills, and works with a ChatGPT subscription or other API keys. It lowers the ops burden for deployed coworkers, so teams can test packaged agents before building their own stack.
OpenClaw 2026.3.24 adds native Microsoft Teams, OpenWebUI sub-agent access, Slack reply buttons, and a control surface for skills and tools. The release expands where the runtime can plug into enterprise workflows, while also increasing the surface area teams need to secure.
CopilotKit shipped hooks that let agents inspect app state and call frontend actions, then paired them with Shadify for ShadCN-based UI composition. It gives embedded agents a cleaner path from chat to in-app behavior.
Agent Flywheel lays out a planning-first workflow built on beads, agent mail, swarms, and TUI inspection for very large coding runs. It is useful because the guide exposes coordination primitives and review loops, not just benchmark screenshots.
LangSmith Fleet introduces shared agents with edit and run permissions, agent identity, human approvals, and tracing. That matters because enterprise agent rollout is shifting from single-user demos to governed, auditable deployment surfaces.
CopilotKit published a generative-ui repo plus an MCP server for bringing agent-built interfaces into existing applications. It gives teams concrete patterns for controlled, declarative, and open-ended UI surfaces instead of ad hoc demos.
JetBrains introduced Air, an agentic development environment where multiple agents can edit alongside you on host, in Docker, or on remote VMs. Its shared out-of-process state and checkout-free Git-server filesystem make multi-agent workflows easier to inspect and control.
LangChain rebranded Agent Builder to Fleet and added agent identity, memory, sharing controls, and LangSmith tracing for multi-user agent operations. It gives teams a governed way to deploy Slack- and GitHub-connected agents without stitching auth and auditing together by hand.
Cognition updated Devin so one session can break down large work and delegate subtasks to worker Devins running in separate VMs. It matters for audits, migrations, and QA runs where one long-context agent is slower than explicit parallelism.
Google now lets Gemini chain built-in tools like Search, Maps, File Search, and URL Context with custom functions inside a single API call. This removes orchestration glue for agent builders and brings Maps grounding into AI Studio for faster prototyping.
Manus added Google Workspace CLI support so one prompt can reply to Doc comments, edit individual Sheet cells, rename Slides, and reorganize Drive folders. Test it if your agent workflows already depend on Google Workspace data and approvals.
Conductor 0.39 adds instant chat summaries, PR actions in the command palette, in-file search, and a faster experimental sidebar. Teams running multi-session coding workflows can use it to cut UI friction and reduce context switching during review and triage.
CopilotKit open-sourced a generative UI template that renders agent-created HTML and SVG in a sandboxed iframe, with examples for charts, diagrams, algorithms, and 3D components. Use it to build interactive chat outputs without waiting for vendor-specific platform support.
Perplexity rolled Computer out to Pro subscribers and added Slack workflows, app connectors, custom skills, and credit-based usage for enterprise teams. Try multi-model agent workflows on real apps, but watch credit usage and local execution tradeoffs.
Nous Research shipped Hermes Agent v0.2.0 after 216 merged PRs, adding native MCP support, editor integrations, worktree isolation, rollback, and a larger skills ecosystem. Try it in real repos if you want broader tool support, official Claude support, and lighter installs.
CopilotKit added AG-UI and A2UI support so agents defined with open specs can stream interactive UI directly to the frontend. Use it to reduce custom glue between backend agent runtimes and user-facing interfaces across stacks.
ByteDance released DeerFlow 2.0 as an open-source multi-agent system with a browser workspace, parallel tasking, and OpenAI-compatible model support. Try it if you want a reusable repo for autonomous research-and-build workflows instead of a demo stack.
Ollama added scheduled /loop prompts for Claude Code, enabling recurring research, reminders, bug triage, and PR checks. Use it to automate background routines in local or self-hosted agent setups without adding a separate scheduler first.
Microsoft introduced Copilot Cowork for Microsoft 365, turning requests into plans that execute across apps and files within enterprise security and governance boundaries. Teams evaluating office agents should watch approvals, app coverage, and durable execution across its multi-model setup.
Hyperbrowser released HyperPlex, an open-source research agent that splits a goal into subtasks, runs browser workers in parallel, and returns cited reports. Teams building deep-research products can study the repo for orchestration, live browsing, and report synthesis patterns.