breakingMarch 9, 2026

Microsoft launches Copilot Cowork for Microsoft 365 to run multi-step tasks across apps

Microsoft introduced Copilot Cowork for Microsoft 365, turning requests into plans that execute across apps and files within enterprise security and governance boundaries. Teams evaluating office agents should watch approvals, app coverage, and durable execution across its multi-model setup.

3 min read

Microsoft launches Copilot Cowork for Microsoft 365 to run multi-step tasks across apps

TL;DR

Microsoft's launch demo introduces Copilot Cowork for Microsoft 365 as an agentic handoff: you give it a task, it turns that request into a plan, and then executes across apps and files.
According to Microsoft wording, execution stays "grounded in your work data" and inside Microsoft 365 security and governance boundaries rather than acting as a separate consumer-style agent.
The most consequential implementation detail in the product text is a multi-model setup: Microsoft says it worked with Anthropic to integrate the technology behind Claude Cowork, and that Copilot can choose "the right model for the job."
Early reaction from the follow-up thread highlights the open engineering questions: how much model choice users actually get, whether the best models are exposed consistently, and how far Cowork can go beyond Microsoft app outputs.

What shipped

Copilot Cowork is Microsoft 365's move from chat-style assistance to delegated execution. In Microsoft's own description via the launch demo and the announcement wording, the system takes a request, generates a plan, and works across the user's apps and files instead of staying inside a single document or chat pane.

The enterprise control story is central to the launch. The product text captured in the product screenshot says identity, permissions, and compliance policies "apply by default," actions and outputs are auditable, and Cowork runs in a "protected, sandboxed cloud environment." That same text says durable execution is meant to let tasks keep progressing safely across devices, which matters more than the flashy demo language because it defines whether these agents can run as background workflows instead of interactive copilots.

A concrete example from the walkthrough post shows the intended workflow: prep for a meeting by scanning email, drafting a briefing doc, building a PowerPoint, and blocking calendar time. That post also says Cowork "builds a plan, asks for your approval," which suggests Microsoft is putting an approval gate between planning and execution rather than letting the agent act completely autonomously.

How Microsoft is differentiating it — and what remains unclear

Microsoft's clearest differentiator is model orchestration, not just app integration. The text in the product screenshot says Microsoft worked closely with Anthropic to integrate the technology behind Claude Cowork into Microsoft 365 Copilot, and adds that users are "not limited by one brand of models." The supporting reaction in the model-flexibility post makes the same point more bluntly: the claimed advantage is that Cowork is not tied to a single model family.

That matters only if the routing is transparent and the capability ceiling stays current. In his thread, Ethan Mollick points to the risk that Microsoft could use weaker models without making that obvious, and he argues recent frontier-model deltas are large enough that the choice materially changes agent performance. His follow-up in the follow-up thread raises a second implementation question: whether Copilot Cowork can improvise outputs with code, or whether it is mostly confined to artifacts that fit neatly into Microsoft app workflows.

For engineers evaluating office agents, those are the unresolved technical questions behind the launch: approval semantics, actual app and file coverage, how durable the background execution really is, and how observable the model-selection layer will be in production.

TL;DR

What shipped

How Microsoft is differentiating it — and what remains unclear

Discussion across the web