releaseMarch 30, 2026

Hermes Agent 0.6.0 adds multi-agent profiles and OpenWebUI streaming

Nous Research shipped Hermes Agent 0.6.0 with multi-agent profiles, release docs, and OpenWebUI tool-call streaming through its OpenAI-compatible endpoint. One install can now host separate agents with isolated memory, skills, and gateway connections.

Hermes Agent Coding Agents Multi-Agent Systems Developer Experience

3 min read

Hermes Agent 0.6.0 adds multi-agent profiles and OpenWebUI streaming

TL;DR

Nous Research’s launch post shipped Hermes Agent 0.6.0, and the linked full changelog turns the biggest change into a deployment feature: one install can now run multiple isolated agent profiles with separate config, memory, sessions, skills, and gateway state.
Teknium’s profiles post says those “independent bots” are available after hermes update, while the release notes describe strict token-locked isolation plus create, switch, export, and import flows for profile management.
Hermes 0.6.0 also expands its integration surface: the release notes add MCP server mode for MCP clients, an official Dockerfile, and an ordered provider fallback chain so inference can fail over when a primary backend goes down.
A follow-up streaming demo shows tool-call streaming now working in OpenWebUI through Hermes Agent’s OpenAI-compatible endpoint, and the linked API server docs say that endpoint supports streaming chat completions and stateful /v1/responses flows with tool use.

What shipped in Hermes Agent 0.6.0?

Nous Research

@NousResearch

·Follow

The Hermes Agent update you've been waiting for is here.

Watch on X

6:43 PM · Mar 30, 2026

3.9K

Read 268 replies

The headline feature in Hermes 0.6.0 is profiles. According to the release notes, each profile is a fully isolated Hermes instance with its own configuration, memory, sessions, skills, gateway service, and state, so a single installation can host multiple agents without credential or context collisions. Teknium’s profiles post summarized it as “as many independent bots” with separate “memory, gateway connections, skills, chat history, everything,” and pointed users to update in place.

The release widens deployment options beyond profile isolation. The full changelog adds an MCP server mode that exposes Hermes conversations and sessions to MCP-compatible clients including Claude Desktop, Cursor, and VS Code over stdio or HTTP. The same release also adds an official Dockerfile and an ordered fallback provider chain, which the changelog says can automatically fail over when the first inference provider fails.

Nous also used 0.6.0 to deepen chat-platform support. In the release notes, the new and expanded adapters include Feishu/Lark, WeCom, Slack multi-workspace OAuth, and Telegram webhook support, pushing Hermes further toward multi-channel agent operations rather than a single local CLI workflow.

How does the new OpenWebUI workflow work?

Teknium (e/λ)

@Teknium

·Follow

Got tool call streaming working in OpenWebUI with our openai endpoint for Hermes Agent! So badass! Get the bleeding edge feature update with a simple `hermes update` in your console!

2:12 AM · Mar 31, 2026

154

Read 10 replies

The first post-release workflow demo is OpenWebUI integration with streamed tool calls. Teknium’s streaming demo says Hermes now has “tool call streaming” working in OpenWebUI through its OpenAI endpoint, which matters because it exposes intermediate tool activity instead of making tool-using runs feel like opaque long-polls.

Teknium (e/λ)

@Teknium

·Follow

Replying to @Teknium

FYI for docs on setting this up: hermes-agent.nousresearch.com/docs/user-guid…

3:22 AM · Mar 31, 2026

Read 1 reply

The API docs show that Hermes exposes an OpenAI-compatible HTTP server for frontends such as Open WebUI, LobeChat, LibreChat, NextChat, and ChatBox. That server supports streaming on /v1/chat/completions, including “real-time tool progress indicators,” and also offers /v1/responses for server-side conversation state via previous_response_id. In practice, that means Hermes can sit behind common OpenAI-format UIs while still surfacing its own tool stack for terminal commands, file operations, web search, memory, and skills.

Adoption was already spiking around the release. Teknium’s usage screenshot called it Hermes Agent’s “biggest day ever,” and the attached analytics image

shows OpenRouter usage climbing toward roughly 28B tokens on the latest day displayed, alongside “302B Total tokens” and “192 Models used.”

🧾 More sources

TL;DR2 tweets

Top-line summary of the 0.6.0 release, focused on multi-agent profiles, deployment features, and the new OpenWebUI streaming workflow.

What shipped in Hermes Agent 0.6.0?2 tweets

Core release details from the launch thread and changelog, emphasizing profiles, MCP support, Docker, failover, and platform adapters.

How does the new OpenWebUI workflow work?1 tweets

Post-release implementation details showing streamed tool calls through the OpenAI-compatible API server and early usage context.