Codex
OpenAI coding agent product and harness layer.
Stories
Filter storiesOpenAI said Codex accounts were seeing faster usage draining than intended because abuse and fraud checks were overflagging some sessions, then issued a usage reset for all users. It matters because paid Codex workflows were losing quota unexpectedly mid-run, directly affecting reliability and cost.
OpenAI published usage data showing Codex now generates 99.8% of its internal AI output tokens, with sharp growth in legal, support, recruiting, and finance. The report measures agent adoption as delegated parallel work, not just chat inside engineering.
Codex can now hand off an in-progress thread between local and remote machines and bring it back later. It matters because the handoff carries Git history, branches, and uncommitted changes while leaving the destination checkout untouched.
OpenAI added Record & Replay to Codex so users can demonstrate a repetitive computer task once and save it as a reusable skill. The first rollout is Mac-only and unavailable in the EEA, UK, and Switzerland, so teams should check access before planning rollout.
Codex workflows can now run against open-weight models served through compatible Responses API endpoints, with Ollama and vLLM publishing direct paths for GLM-5.2 and Kimi K2.7 Code. That matters because teams can keep the Codex interface while swapping to self-hosted or lower-cost inference backends.
OpenAI expanded Codex in Europe with Computer Use, the Chrome extension, Memory, and Chronicle. The rollout broadens browser and desktop automation outside the U.S., though some memory features remain opt-in or preview-only.
Codex users are having the agent write its own `/goal` and sub-agent goals, with OpenAI-side commentary describing that as a built-in meta-prompting pattern. The workflow turns long autonomous runs into a tighter control loop, but users still review goals first so a bad objective does not burn tokens for hours.
OpenAI shipped a docs agent that can hand off guides to Codex, and users published Appshots, browser-control, parallel PR, and multi-tree workflows. Watch the examples for ways to structure Codex around orchestrated tasks, while code-review and plugin gaps remain.
AI SDK canary added HarnessAgent, a unified abstraction that runs Claude Code, Codex, and Pi in sandboxed sessions with AI SDK-compatible streams. One integration can now target multiple agent harnesses without separate model-specific plumbing.
OpenAI said it will acquire Ona and fold its secure cloud execution and orchestration stack into Codex. The change targets agent jobs that need to keep running for hours or days after the original laptop session ends.
OpenAI started rolling out bankable Codex resets to Go, Plus, Pro, and Business users, plus a two-week referral program that can add more resets. That lets users save capacity for heavier Browser use and longer Codex sessions instead of losing resets on a fixed clock.
Codex usage moved further into phone-first workflows, with iOS dictation loops, background voice capture, and app updates like searchable settings and restored state. The comparisons still flag rough spots in multi-thread UX, Windows support, and cases where CLI tabs or cloud agents are easier to manage.
Helmor released an open-source mobile client that exposes Claude Code, Codex, OpenCode, and custom model backends behind a phone-first UI plus one-click Cloudflare Tunnel setup. The launch targets remote coding sessions from a handset instead of a laptop-only agent workflow.
A community workflow broke long-running Codex goals into six required fields, then added an eight-item preflight checklist and helper tools. The structure is meant to reduce runs that drift, stop early, or claim completion without an objective verification step.
MagicPath launched as an official Codex plugin, adding a shared canvas for interactive UI work, repo imports, design-system context, and image generation inside Codex. It matters because Codex now has a native surface for design-and-build loops instead of limiting collaboration to chat and code diffs.
A day after Codex users reported outages and caps, OpenAI said the service had three separate incidents and later disclosed a bug that undercounted tokens for some Plus and Pro accounts, while users reported paid-plan quotas reset. The update matters because Codex operators saw both service instability and account-limit changes in the same 24-hour window.
OpenAI expanded the Build iOS Apps plugin so Codex can test apps in an in-app browser, open SwiftUI previews, and hot-reload edits without leaving Codex. It matters because more of the iOS iteration loop stays inside the coding agent instead of bouncing through external simulators and manual preview steps.
Users reported outages, tighter 5-hour caps, and token availability problems a day after OpenAI launched Codex Sites and plugins. OpenAI reset Codex usage limits after three incidents, so teams should watch quotas and backend reliability as agent workflows ramp up.
OpenAI rolled out Codex Sites, annotations, and role-specific plugins, while weekly users topped 5 million. The release pushes Codex beyond coding into hosted workspace and app workflows for enterprise teams.
OpenAI made GPT-5.4, GPT-5.5, and Codex generally available through Amazon Bedrock. AWS shops can now use OpenAI models inside existing IAM, compliance, and procurement workflows instead of adopting a separate vendor stack.
OpenAI shipped a Python SDK and app-server support for Codex with thread creation, streamed turns, session resume, image inputs, and sandbox controls. That gives teams a supported way to embed Codex inside internal tools and automation instead of driving it only through the CLI or desktop app.
OpenAI restored Codex weekly and hourly quotas across paid ChatGPT plans after Tibo Sottiaux said the product hit 5 million users. Watch for long-running QA loops, migration PRs, and remote desktop sessions that can still burn through quotas fast.
Independent users compared GPT-5.5/Codex with Opus 4.8/Claude Code using DeepSWE cost charts, GBA Eval runs, and long coding sessions. The split matters because engineers choosing a daily coding stack now have external quality-versus-cost evidence instead of only vendor launch claims.
Builders added /dynamic orchestration, custom-model routing, and repo runbooks around Codex as users exposed new session lifecycle controls in the app. That makes Codex a better fit for long-running, multi-context coding work.
OpenAI added computer use to Codex on Windows and lets ChatGPT mobile steer tasks running on Windows PCs. The update extends Codex to existing Windows dev machines and adds remote review and debugging from mobile.
Codex on iOS now supports side conversations, end-of-turn diff summaries, archived remote threads, model switching, and Spotlight or Shortcuts hooks. The update brings more desktop-style task steering and change review to mobile sessions.
OpenAI and Thrive described Tax AI, a self-improving tax-prep system used across 30+ firms that processed 7,000 returns and reached up to 97% accuracy. The loop turns accountant corrections into eval targets and narrow Codex fixes, showing a concrete path to vertical agents that improve after deployment.
OpenAI said ChatGPT-linked Codex will drop GPT-5.2 and GPT-5.3-Codex on June 2, with GPT-5.5 becoming the default frontier model for free users. The API versions stay available, but the in-product model surface is being reduced for compute-fleet management.
Microsoft Research released SkillOpt, which optimizes external skill files instead of fine-tuning model weights and reports best-or-tied results across 52 evaluation cells. The method matters because it improved Codex and Claude Code accuracy without extra inference-time calls.
Practitioners published tests-first coding-agent workflows built around red-green TDD, Hurl suites, GitHub label actions, and Codex-based execution checks. The pattern matters because verification remains the main bottleneck once generation is fast, especially in longer multi-file sessions.
Practitioners published reusable Codex workflows for project audits, memory-driven skill packaging, mobile delegation, and remote computer use. Try the prompt-and-steps patterns if you want to adapt Codex across repos and devices.
Independent Codex users published Obsidian memory setups, reusable skill prompts, auto-triage flows, and Cloudflare-backed runners for longer jobs. That matters because Codex is being wrapped into persistent workspaces and operator-defined subagents instead of one-shot chats.
OpenAI said a recent Codex optimization lowered cache-hit rates in long-running sessions, drained limits faster, rolled it back, and reset all accounts. That matters because compaction and cache behavior directly determine quota burn and session reliability.
Two days after Codex added locked-Mac control and Appshots, users posted end-to-end iPhone simulator debugging, Safari form-filling, and remote-control workflows. That matters because the feature is moving from launch copy into concrete computer-use tasks that can replace manual QA and repetitive UI work.
Developers say Codex v0.133.0 improved compaction, remote-control workflows, and Chrome-driven Colab runs after `/goal` became default. The same update window also brought easier skill discovery and new diff options, though some users saw approval-pause regressions in full-access mode.
OpenAI shipped a Codex update that lets the mobile app control a locked Mac, adds Appshots for screen context, and graduates /goal. It also adds browser annotation tools, team plugin sharing, and expanded analytics for business users.
OpenAI said a metering bug put many Codex subscribers at the wrong usage level for about two hours, then restored balances and waived usage from that window. This matters because the incident interrupted active sessions and showed how subscription sync failures can halt agent runs mid-task.
OpenAI shipped shortcut customization, restored Git controls, cleaned up panels, and sped up large-repo operations in Codex. Paid-plan usage caps were also reset, though some accounts saw delayed propagation.
Days after `/goal` workflows first surfaced, users showed the command also works in the Codex app and shared runs for SSH setup, mech-interp scripts, and recurring work that lasted hours or days. The evidence points to Codex being used as a long-running research and ops agent, though the app still lacks explicit `/goal` UI.
OpenAI documented Codex remote connections, letting the ChatGPT app point at a separate Codex host such as a Mac mini or rented VPS. Try it for long runs that need to stay alive off-device or for phone-first coding sessions.
OpenAI said Codex’s GPT-5.5 degradation over the prior 48 hours came from two issues and it will reset usage limits after the fix. Users had reported looping runs, higher cache burn, and unstable sessions in active coding workflows.
Zed users can now sign in with a ChatGPT subscription and use the same OpenAI limits they get in Codex, alongside ACP, Codex CLI, or API-key flows. It removes a separate billing step for teams switching between editor-native and Codex-native workflows.
MagicPath 2.0 can now run as a native canvas inside Codex through a CLI skill, letting users design and refine app UI without leaving the coding session. It matters because visual iteration and implementation handoff now happen inside one agent workflow instead of bouncing between separate design and code tools.
OpenAI rolled out Codex in the ChatGPT mobile app, letting users start work, review outputs, approve steps, and steer remote sessions from iPhone or Android. The preview keeps execution on a laptop, Mac mini, devbox, or SSH target while syncing screenshots, diffs, and terminal state back to mobile.
OpenAI launched a 30-day migration offer that grants eligible enterprise customers two free months of Codex usage for new users. The promotion is meant to pull coding teams onto Codex as rival agent workflows get more expensive.
OpenAI detailed the Windows sandbox behind Codex, using local user accounts, ACLs, firewall rules, and DPAPI-protected secrets instead of a generic VM wrapper. The design gives Windows developers safer file and network controls without making coding-agent workflows unusable.
OpenAI showed Codex working across apps in the background without taking over the Mac, and early users applied it to Telegram BotFather setup and front-end testing. That matters because Codex is moving from repo-only work into authenticated desktop workflows and UI-driven task loops.
OpenAI staff said /goal is now available in the Codex app, and users posted long-running runs that fixed React Doctor scores, built iOS features, and queued weekend tasks. The update moves Codex from CLI-only planning to persistent, steerable work sessions.
Engineers shared fresh measurements on GPT-5.5 cache reuse, /fast pricing, and bug-finding budgets after comparison posts for GPT-5.5 and Opus 4.7 led the coding round-up. The reports suggest Codex cost and quality now swing on cache behavior and effort settings as much as on list prices.
Posts and screenshots from TestingCatalog, Kolt Regaskes, and others say Codex remote access is being prepared inside the ChatGPT app, but OpenAI has not confirmed a release. If real, the feature would extend the recent remote-control push from desktop sessions to phones.