releaseMarch 21, 2026

OpenAI adds container pools to Responses API for 10x faster agent spin-up

OpenAI says Responses API requests can reuse warm containers for skills, shell, and code interpreter, cutting startup times by about 10x. Faster execution matters more now that Codex is spreading to free users, students, and subagent-heavy workflows.

Codex Durable Execution LLM Serving Developer Experience

2 min read

OpenAI adds container pools to Responses API for 10x faster agent spin-up

TL;DR

OpenAI says the Responses API update adds a container pool so skills, shell, and code interpreter sessions can reuse warm infrastructure, cutting agent container spin-up by about 10x.
The change matters because OpenAI is widening Codex access: a ChatGPT note says Codex is now usable from free accounts, while the student offer repost points to $100 in credits for college students in the U.S. and Canada.
Early multi-agent workflows make the latency win easier to picture: the subagents screenshot shows a run "Spawning 5 agents" before creating five named explorer and worker agents.
OpenAI's amplified repost suggests the speedup is being pushed as a platform-level developer update, not just a Codex UX tweak.

What shipped in the Responses API?

OpenAI Developers

@OpenAIDevs

·Follow

Agent workflows got even faster. You can spin up containers for skills, shell and code interpreter about 10x faster. We added a container pool to the Responses API, so requests can reuse warm infrastructure instead of creating a full container creation each session. Show more

7:24 PM · Mar 21, 2026

2.1K

Read 110 replies

OpenAI says it "added a container pool to the Responses API," which lets requests reuse warm infrastructure instead of paying full container creation cost on every session. In the same post, OpenAI says container startup for skills, shell, and code interpreter is now "about 10x faster" launch details.

That is an infrastructure change, not a new agent primitive. The practical shift is lower cold-start overhead for tool-using agents that need an execution environment before they can run code or shell steps, and OpenAI framed it as making "agent workflows" faster rather than changing model behavior repost.

Why does the speedup matter now?

Diego | AI 🚀 - e/acc

@diegocabezas01

·Follow

Codex subagents game changer

2:22 PM · Mar 21, 2026

234

Read 26 replies

The timing lines up with broader Codex distribution. A developer post says "You can use Codex in your free ChatGPT account" free-tier note, and a separate OpenAI promotion for students, surfaced in the repost, offers college students in the U.S. and Canada $100 in Codex credits.

That wider access increases the odds that more users will hit execution latency in real workflows, especially when one request fans out into several workers. The shared screenshot shows a concrete example: one run starts by "Spawning 5 agents" and then creates two "explorer" agents and three "worker" agents, which is exactly the kind of subagent-heavy pattern where warm container reuse can shave visible setup time.

🧾 More sources

TL;DR3 tweets

Top-line facts: the API change, the claimed 10x speedup, and why broader Codex access makes execution latency more relevant.

What shipped in the Responses API?1 tweets

Core product detail on the new container pool and what part of the agent stack it accelerates.

Why does the speedup matter now?2 tweets

Supporting context showing Codex access expanding and multi-agent workflows becoming more common.