OpenAI's coding agent for software engineering tasks such as generating code, fixing bugs, answering codebase questions, and reviewing changes.

Recent stories

50 linked stories

newsPRIMARY2026-05-13

OpenAI offers 2 free months of Codex to enterprise switchers

OpenAI launched a 30-day migration offer that grants eligible enterprise customers two free months of Codex usage for new users. The promotion is meant to pull coding teams onto Codex as rival agent workflows get more expensive.

newsPRIMARY2026-05-13

Codex introduces Windows sandbox with firewall rules and write-restricted tokens

OpenAI detailed the Windows sandbox behind Codex, using local user accounts, ACLs, firewall rules, and DPAPI-protected secrets instead of a generic VM wrapper. The design gives Windows developers safer file and network controls without making coding-agent workflows unusable.

newsPRIMARY2026-05-12

OpenAI Codex supports background computer use with Mac app control and Telegram BotFather setup

OpenAI showed Codex working across apps in the background without taking over the Mac, and early users applied it to Telegram BotFather setup and front-end testing. That matters because Codex is moving from repo-only work into authenticated desktop workflows and UI-driven task loops.

newsSECONDARY2026-05-11

Artificial Analysis launches Coding Agent Index: Cursor plus Opus 4.7 scores 61, Codex plus GPT-5.5 60

Artificial Analysis launched a Coding Agent Index for model-and-harness pairs, while OpenHands refreshed its model leaderboard. The results show harness choice matters, with cost varying over 30x and task time over 7x across stacks.

releaseSECONDARY2026-05-11

OpenAI launches Daybreak with GPT-5.5-Cyber, Codex workflows, and repo scanning

OpenAI launched Daybreak, combining GPT-5.5, Codex workflows, repo scanning, threat modeling, and patch generation for cyber-defense teams. It packages frontier models into a continuous secure-software workflow, so teams can test whether it fits their response pipeline.

workflowSECONDARY2026-05-11

Developers launch Agent FM, Mate, and ntm for multi-session Claude Code and Codex control

Independent developers shipped new control-plane tools for long-running coding agents, including Agent FM audio monitoring, Mate phone-first remote control, and ntm for provider-agnostic multi-agent workflows. It matters because teams running many Claude Code and Codex sessions still need better visibility, handoff, and checkpointing than a single built-in session list provides.

releaseSECONDARY2026-05-10

Crabbox 0.11.0 adds Google Cloud provider and repo-local job workflows

Crabbox 0.11.0 shipped a Google Cloud provider, repo-local job workflows, AWS Windows WSL2 hydration, and a Blacksmith sync-stall guard. Recent Codex and OpenClaw posts show Crabbox already being used for reproducible bug repro and recorded QA before-and-after runs.

workflowPRIMARY2026-05-10

Codex app adds /goal for long-running React Doctor and iOS runs

OpenAI staff said /goal is now available in the Codex app, and users posted long-running runs that fixed React Doctor scores, built iOS features, and queued weekend tasks. The update moves Codex from CLI-only planning to persistent, steerable work sessions.

newsSECONDARY2026-05-10

GPT-5.5 users report 3.3M cached tokens and 2.5x /fast credits

Engineers shared fresh measurements on GPT-5.5 cache reuse, /fast pricing, and bug-finding budgets after comparison posts for GPT-5.5 and Opus 4.7 led the coding round-up. The reports suggest Codex cost and quality now swing on cache behavior and effort settings as much as on list prices.

newsPRIMARY2026-05-10

Codex reportedly leaks mobile remote access in ChatGPT app screenshots

Posts and screenshots from TestingCatalog, Kolt Regaskes, and others say Codex remote access is being prepared inside the ChatGPT app, but OpenAI has not confirmed a release. If real, the feature would extend the recent remote-control push from desktop sessions to phones.

newsSECONDARY2026-05-09

GPT-5.5 vs Opus 4.7: users compare plan mode, frontend output, and 120K-context use

User posts and HN threads compared GPT-5.5 and Opus 4.7 across plan mode, frontend work, and 120K-context sessions. The split results mean token burn and instruction discipline matter as much as raw benchmark scores.

releasePRIMARY2026-05-09

Codex 0.130.0 adds `codex remote-control` and migration support for Code and Cowork

A day after `/goal` and remote-control preview surfaced, Codex 0.130.0 shipped a simpler headless entrypoint while the app’s migration tool added Code and Cowork support. Users also showed Codex handling bug repro, long-running `/goal` sessions, and plugin-driven expense filing, which broadens its role from chat-first coding to delegated workflows.

newsPRIMARY2026-05-08

Codex adds /goal mode for long-running tasks with remote control preview

OpenAI reports Codex can now keep pursuing a goal until an end state and is adding remote control plus a usage tab. The update matters because Codex sessions can span longer tasks and be managed across devices with less manual babysitting.

releasePRIMARY2026-05-07

OpenAI launches Codex Chrome extension for background tabs and logged-in sites

OpenAI shipped a Chrome extension for Codex on macOS and Windows that can work across logged-in sites and multiple background tabs. It should speed up testing, data entry, and other web app tasks by letting Codex run more parallel browser work.

workflowPRIMARY2026-05-03

Codex users report `/goal` sessions with 70-minute Stripe fixes and a 4,000-prompt cap

Users posted long-running Codex `/goal` sessions with auto-continuations, `pause`/`resume`, and file-backed goals. Watch the 4,000-prompt startup cap and early-stop drift if you plan to run longer agent loops.

releasePRIMARY2026-05-03

Codex updates Auto-Review to default with ~200x fewer approvals

OpenAI said Auto-Review is now the default inside Codex after an internal rollout cut needed approvals by about 200x. The shift moves more coding-agent work into guarded review loops with policy and egress controls.

releaseSECONDARY2026-05-03

Crabbox 0.4.0 launches ephemeral agent machines on Spot instances

Crabbox 0.4.0 adds throwaway machines for agent runs and cross-platform reproduction on macOS, Linux, and Windows. Use it to reproduce bugs and validate fixes without keeping long-lived cloud sessions around.

newsPRIMARY2026-05-03

Codex community ships Security plugin, Plannotator, and `dcg` hooks as third-party tooling forms

Independent builders shipped a Codex security-review pack, planning and annotation integration, and `dcg` safety-hook support in the same window. The burst matters because review, guardrail, and workflow tooling is forming around Codex beyond OpenAI’s own releases.

newsPRIMARY2026-05-02

Codex users report one-shot fixes and 1.7B-token days vs Claude Code

Developers posted side-by-side reports of faster one-shot fixes, 1.7B-token workdays, and fewer limit warnings with GPT-5.5 fast mode after OpenAI added Claude Code import. The comparisons matter because they turn migration talk into a concrete workflow choice.

releasePRIMARY2026-05-02

Codex adds `/hatch` pets, in-pet chat replies, and one-curl Petdex installs

OpenAI and community posts showed a new Codex pet layer built around `/hatch`, sprite-sheet generation, active-chat replies from the pet UI, and public pet galleries like Petdex. The feature matters because it turns Codex skills into a reusable UI-extension surface, not just a chat interface.

newsPRIMARY2026-05-01

OpenAI adds one-click Claude Code migration to Codex

OpenAI added one-click import for settings, plugins, agents, and project config into Codex, and users reported cleaner workflows with visible subagents and in-chat CI status. That reduces setup friction for existing agent stacks, and OpenAI says Codex revenue doubled in under seven days.

newsSECONDARY2026-04-30

OpenAI adds Advanced Account Security with passkeys

OpenAI added an opt-in security mode for ChatGPT and Codex that disables password-based recovery, shortens sessions, and requires passkeys or physical keys. Higher-risk accounts get stronger phishing resistance and automatic exclusion from model training when the mode is enabled.

releasePRIMARY2026-04-30

Codex adds `/goal`, role-based workflows, and 20% faster browser use

OpenAI expanded Codex with role-based work-flows, app connections, in-app previews, and the `/goal` command, while also improving browser use by about 20%. The update lets Codex keep working across docs, slides, spreadsheets, and web actions instead of staying in a single coding thread.

releasePRIMARY2026-04-29

OpenAI adds WebSocket mode to Responses API for 40% faster Codex loops

OpenAI added WebSocket mode to the Responses API and says it cuts repeated work across Codex tool loops, improving end-to-end speed by up to 40%. The change reduces runtime overhead for agent workflows, not just base-model latency.

newsSECONDARY2026-04-28

AWS and OpenAI launch Bedrock Managed Agents with Codex and model access in limited preview

AWS and OpenAI moved their expanded partnership into limited preview, bringing OpenAI models, Codex, and Bedrock Managed Agents onto AWS. That gives teams a direct AWS path for OpenAI-backed agent workflows instead of waiting on the earlier coming-soon timeline.

releasePRIMARY2026-04-28

Codex adds macOS computer use, in-app browser, and artifact previews

Codex gained background macOS control, page inspection, image generation, plugins, artifacts, and follow-up automations. That gives it one agent thread for desktop apps, frontend debugging, and recurring work.

releaseSECONDARY2026-04-27

Symphony launches Codex orchestration for Linear and GitHub issue queues

OpenAI released Symphony, an orchestration layer that turns issue trackers into Codex agent queues for PR generation and review. Early users say it can move many tickets in parallel, but token burn rises quickly when agents fan out.

newsPRIMARY2026-04-27

Codex raises paid-plan limits after GPT-5.5 shipping week

OpenAI reset Codex rate limits across all paid plans after a week of GPT-5.5 shipping. The temporary bump changes immediate capacity for active teams, but it was announced as a celebratory reset rather than a permanent quota change.

newsSECONDARY2026-04-26

Users report GPT-5.5 speeds up coding and cuts over-editing in low-reasoning runs

New evals and day-three user tests show GPT-5.5 performing well at low or medium reasoning, with benchmark gains over GPT-5.4 in coding-heavy use. That matters because stronger results no longer require xhigh runs, though some users still flag sycophancy.

workflowPRIMARY2026-04-26

Codex app-server supports 32-64 parallel jobs and burns limits 3-5x faster

OpenAI docs say Codex image generation counts against general usage and burns included limits 3-5x faster, while users showed app-server runs with 32 or 64 parallel workers. The workflow turns bulk image or research jobs into quota-backed batches, so teams should watch usage spikes closely.

newsSECONDARY2026-04-25

GPT-5.5 users report 4-10x shorter runs and smoother tool calls one day after launch

Users and third-party evals reported shorter runs, stronger long-context scores, and faster rollout into Cursor and other tools a day after GPT-5.5 hit the API. Higher per-token pricing may be partly offset by lower loop time and fewer tool-call stalls, so watch early bench data before changing defaults.

newsSECONDARY2026-04-25

Claude Code users report 30-40% token growth and incomplete long tasks

Users reported higher token use, partial long-document reviews, and rising spend on routine tasks after Claude Code regressions came into focus. Some developers still get strong results in constrained harnesses, but others may want to switch to Codex for long-running work.

workflowSECONDARY2026-04-25

ClawSweeper closes 4,000 OpenClaw issues with 50 Codex agents in one day

Steipete’s maintainer bot ran 50 Codex agents in parallel and closed about 4,000 OpenClaw issues in a day. The cleanup pushed into rate limits, so use the README dashboard and Project Clowfish clustering to track large agent sweeps.

newsPRIMARY2026-04-24

Codex users report one-shot bug fixes, 10-hour runs, and lower token burn a day after GPT-5.5 launch

A day after GPT-5.5 and the new Codex workflows launched, developers reported one-shot bug fixes, longer unattended runs, and lower token use in real coding tasks. The early hands-on comparisons matter because they are already shifting some teams' default agent workflow away from Claude Code.

releaseSECONDARY2026-04-23

Cua Driver opens macOS background app control with multi-cursor support for Claude Code and Codex

Cua Driver open-sourced a macOS driver that lets agents control apps in the background with multi-player and multi-cursor support. It matters because it turns background computer use from an app-specific feature into a reusable primitive that any agent loop can adopt.

releasePRIMARY2026-04-23

OpenAI releases GPT-5.5 with 82.7% Terminal-Bench and Codex browser control

OpenAI rolled out GPT-5.5 and GPT-5.5 Pro in ChatGPT and Codex, with higher scores on terminal, OS, cyber, and math evals than GPT-5.4. Codex also gained browser, document, and computer-use features for longer agent workflows.

newsSECONDARY2026-04-22

OpenAI launches workspace agents in ChatGPT with Slack, Linear, and scheduled actions

OpenAI introduced shared workspace agents in ChatGPT for Business, Enterprise, Edu, and Teachers plans, with Codex-powered background work across tools like Slack and Linear. The launch turns ChatGPT from a single-session assistant into a long-running team workflow surface with approvals, scheduling, and shared context.

newsPRIMARY2026-04-21

Codex reaches 4 million weekly users and resets rate limits

OpenAI said Codex passed 4 million weekly users less than two weeks after clearing 3 million, and then reset usage limits again. The scale jump matters because it points to rapid coding-agent adoption and likely plan and capacity changes.

releaseSECONDARY2026-04-21

OpenAI launches GPT Image 2 with thinking, 2K outputs, and text rendering gains

OpenAI released GPT Image 2 in ChatGPT, Codex, and the API with thinking mode and 2K outputs. Early tests and Arena scores suggest it is usable for slides, UI mockups, and dense infographic layouts.

releasePRIMARY2026-04-20

OpenAI Codex adds Chronicle screen memories in macOS Pro preview

OpenAI added Chronicle, a Codex preview that turns recent screen context into reusable memories for errors, files, docs, and workflows. The macOS Pro-only feature stores local memory unencrypted and can burn rate limits quickly, so watch prompt-injection risk before relying on it.

workflowPRIMARY2026-04-19

Codex users report subagent, MCP, and canary deploy workflows

Practitioners shared repeatable Codex workflows for long-lived threads, background subagents, computer-use access through MCP, and canary rollouts. Codex is being used less as a one-shot assistant and more as a persistent automation harness.

workflowPRIMARY2026-04-17

Codex supports hidden-app control on macOS as users report 38-hour computer-use sessions

Fresh hands-on reports show Codex controlling minimized apps via macOS APIs, using a DOM-aware browser comment mode, and running for day-long sessions in the desktop app. That gives OpenAI stronger evidence that computer use is usable for daily development, though the rollout remains macOS-first and brittle around working-state changes.

releasePRIMARY2026-04-16

Codex adds background computer use on macOS with 90+ plugins and SSH devboxes

OpenAI expanded Codex with background Mac computer use, an in-app browser, image generation, memory preview, automations, and 90+ plugins. The release moves Codex from terminal coding toward long-running UI and ops workflows, though some features remain macOS-first or alpha.

releaseSECONDARY2026-04-16

GPT-Rosalind introduces life sciences reasoning in trusted-access preview

OpenAI launched GPT-Rosalind for biology, drug discovery, and translational medicine, plus a life sciences plugin for Codex. Access starts as a trusted preview for qualified customers, so near-term use is limited to partner and enterprise workflows.

newsSECONDARY2026-04-12

Claude Code reports Opus 4.6 quality drop as BridgeBench retest falls to 68.3%

Fresh retests and issue threads point to worse Claude Code behavior, with Opus 4.6 falling to 68.3% on BridgeBench and users surfacing buried reasoning-effort controls. Track quota burn, hidden effort settings, and rollback reports before assigning more coding-agent work.

releasePRIMARY2026-04-11

Codex 0.120 adds per-project memory extensions and Realtime V2 streaming

Codex 0.120 introduced per-project memory extension files and Realtime V2 progress streaming for background agents. Separate app findings also showed an unreleased Scratchpad view that can start parallel Codex chats from a task list, which may change how teams queue work.

newsPRIMARY2026-04-10

OpenAI rotates macOS app certificates after Axios signing workflow risk

OpenAI said a compromised third-party developer tool affected its macOS app-signing workflow and is rotating certificates for ChatGPT Desktop, the Codex app, Codex CLI, and Atlas. The company said it found no evidence of user-data access or software tampering, and older macOS app versions will stop working after the update window.

newsSECONDARY2026-04-09

OpenAI launches $100 ChatGPT Pro tier with 5x more Codex usage

OpenAI added a $100 ChatGPT Pro tier with 5x more Codex usage than Plus and kept the $200 tier as the highest-capacity option. The new tier resets Codex limits again and temporarily doubles Pro usage through May 31.

newsPRIMARY2026-04-07

OpenAI resets Codex usage limits after 3 million weekly users

OpenAI said Codex reached 3 million weekly users and reset usage limits, with another reset planned for each additional million users up to 10 million. ChatGPT-sign-in Codex will also retire the gpt-5.2 and gpt-5.1-era lineup on April 14, so teams should watch for model-default changes.

newsPRIMARY2026-04-02

Codex adds $0 usage-based seats for ChatGPT Business and Enterprise

OpenAI rolled out Codex-only seats with pay-as-you-go pricing for ChatGPT Business and Enterprise instead of fixed bundled access. The change lowers pilot friction for teams and ties spend directly to coding usage rather than a full ChatGPT seat.