TOPIC50 stories

Coding Agents

Stories about coding-agent products and patterns, including Claude Code, Codex, Cursor, and similar.

Stories

OpenAI renames Codex Superapp to ChatGPT Work in desktop app

Users reported the former Codex Superapp now appears as ChatGPT Work, with Codex inside the ChatGPT desktop flow. Early posts flagged broken sites support and confusing sidebar states.

WORKFLOW9th July

Claude Code streams cloud iOS simulator into browser via serve-sim

Levelsio used Claude Code on a VPS to SSH into a cloud Mac Mini, build an iOS app headlessly, and stream serve-sim into a browser. Follow-up posts documented the tunnel and serve-sim setup, making the workflow reproducible for remote iOS testing.

RELEASE8th July

xAI releases Grok 4.5 for coding agents at $2/$6 per million tokens

xAI announced Grok 4.5 as a coding-and-agent model available in the SpaceXAI console, Grok Build, and Cursor. Posts cite $2 per million input tokens, $6 per million output tokens, and delayed EU availability.

WORKFLOW7th July

Kitze tests codex-lb factory with 4 Codex accounts and 30 isolated tasks

Kitze described using a $60 Hetzner server, four $200 Codex accounts behind codex-lb, self-hosted Paperclip workspaces, and a GPT-5.5 manager to run about 30 isolated tasks. The setup routes parallel coding work across isolated workspaces and paid Codex accounts.

WORKFLOW6th July

Taste rebuilds landing pages from screenshots with Codex or Claude Code

Meng To showed Taste with Codex or Claude Code rebuilding landing pages from screenshots and videos. Voila also launched a tool that inspects UI and turns edits into code, copy, prompts, tasks, or agent handoffs.

DEAL3rd July

Posts report Claude Fable 5 included access ends July 7 before $10/$50 token pricing

Posts report Fable 5 remains included in Claude only through July 7 before moving to $10/$50 per million-token credits. Watch usage closely: tests show roughly 2x Opus consumption in subagent and one-shot app workflows.

RELEASE1w ago

OpenClaw releases iOS and Android apps with secure pairing and push alerts

OpenClaw's maintainers say the community-built iOS and Android apps are live, with secure pairing and push notifications. The rollout also exposed staffing, funding and plugin-support limits, so the project is still looking for contributors.

DEAL3w ago

Claude Code raises 5-hour and weekly usage limits for the weekend

ClaudeDevs said it raised 5-hour and weekly usage caps for the weekend across every plan. The change lands as users report token burn, overage charges, and heavier Claude Code sessions driven by long agent runs and goal-checked workflows.

NEWS3w ago

Codex tests report self-service web signups and folder overwrite failures

Users documented Codex handling self-service signups, repo-maintenance loops, and folder overwrite failures on June 14. Watch the wrapper update closely, since it also added rate-limit reset banking and browser dev mode around the same workflow.

NEWS4w ago

Anthropic removes Fable 5 and Mythos 5 access after US directive

Anthropic said a US government directive forced it to disable Fable 5 and Mythos 5 across Claude products and APIs. The change also pushed Build Day and downstream tooling to Opus 4.8, breaking active Fable sessions and triggering fallbacks in tools like Linear Agent.

NEWS4w ago

Claude Fable 5 builds RC car in Autodesk Fusion after 3 prompts

New creator demos pushed Claude Fable 5 into CAD, landing pages, and web game ports, including an Autodesk Fusion RC car built in three prompts. Watch for longer runs to trip safeguards and fall back to Opus 4.8.

RELEASE4w ago

Grok Build launches Plugin Marketplace beta with MongoDB, Vercel, and Sentry

Grok Build opened a beta plugin marketplace and added official MongoDB, Vercel, and Sentry integrations alongside Chrome DevTools and Cloudflare support. The beta turns deployment, database, debugging, and observability tools into installable terminal actions.

WORKFLOW4w ago

Claude Fable 5 claims GTA 2 clone in 2 hours with Tripo 3D assets

Creators pushed Claude Fable 5 into browser platformers, a GTA 2 clone, a SNES port, CAD models, and a webcam fruit-slicing game. The demos show playable prototypes can now come from prompts plus a few follow-up fixes, so creators can move faster from idea to test build.

RELEASE4w ago

Claude Fable 5 launches with Opus 4.8 fallback and 2.1.170 support

Anthropic opened Claude Fable 5 across Claude Code, Desktop, Cowork, and API, with always-on reasoning and Opus 4.8 fallback on some flagged requests. Early demand triggered model-picker friction and quota pressure, so Anthropic reset the 5-hour and weekly limits the same day.

WORKFLOW4w ago

Claude Fable 5 claims 8,000-line Pokémon game from one prompt

Community demos showed Claude Fable 5 generating playable games and simulations from short prompts, image refs, and goal-based instructions, from Pokémon and F-Zero to city sims and FPS clones. The demos make the model’s creative ceiling clearer, but builders still needed follow-up prompts for speed, style, or polish.

NEWS4w ago

Claude Code reports 1-year GA workflow shifts: auto mode, routines, /usage

Anthropic staff said Claude Code usage has shifted toward auto mode, routines, and phone-based coding one year after GA, and they pointed users to /usage for token breakdowns. The thread matters because it shows Anthropic’s intended daily-driver workflow as community comparisons with Codex intensify.

WORKFLOW4w ago

Practitioner threads report loop-based agent workflows for coding, PR, and sales proposals

Practitioner posts describe loop-based agent systems for coding, PR, sales proposals, and app building, including Kun Chen’s 40-PR-a-day setup, a nine-part vertical-agent framework, and Netlify agent runner builds. Builders can use these patterns to move from single prompts to orchestrated systems with planning, memory, evals, and human checkpoints.

WORKFLOW4w ago

Codex builds a 2D RPG from one prompt in creator tests

Creators showed Codex building a retro 2D RPG, handling browser-based social posting, and replacing parts of the PRD workflow with working prototypes. Users also reported thread-sorting and design limits in the app, so watch where it still breaks down.

RELEASE1mo ago

Anthropic releases ant CLI for Claude Platform and Managed Agents tracing

Anthropic released the ant CLI so Claude Platform APIs, file uploads, and Managed Agents sessions can run from the terminal, then updated Claude Code so /fork starts a background agent with the same context and prompt cache. Teams can use it to script agent runs, inspect traces, and hand work between Claude Code and the platform.

RELEASE1mo ago

OpenAI introduces Codex Sites for shareable interactive apps

OpenAI rolled out Codex Sites so plans, docs, dashboards, and creative concepts can become interactive apps with shareable URLs. The launch also added role-specific plugins, so teams can use the same workspace for thumbnail pulls, transcription, and other nontechnical tasks.

NEWS1mo ago

Claude Code fixes parallel subagent overload and resets 5-hour limits

Anthropic reset 5-hour and weekly usage caps for Pro and Max users after a request-handling bug spawned too many parallel tool calls. If you hit the issue, retry now and expect the Opus 4.8 request path to behave more normally.

WORKFLOW1mo ago

Codex supports 56-hour tasks as builders report passkey and browser failures

Codex users shared 56-hour task runs, PM-to-PR workflows, and a new black-box session recorder for tracking drift, token use, and incomplete responses. The longer autonomous sessions matter because browser auth gaps, passkey failures, and tool-selection bugs become real blockers once Codex is used beyond quick code generation.

RELEASE1mo ago

Claude Opus 4.8 adds mid-conversation system messages with prompt caching

Anthropic documented mid-conversation system messages and automatic cache preservation in Opus 4.8, while Claude Code and Cowork gained /effort controls. Try the new workflow controls if you rely on long sessions, since they may matter more operationally than the raw model bump.

RELEASE1mo ago

grok-build-0.1 launches public beta at $1 input and $2 output per million

grok-build-0.1 moved from subscriber beta into xAI's public API at $1 input and $2 output per million tokens, with integrations across Cursor, OpenRouter, and several agent clients. Watch for app and UI builds now that access is no longer gated to X premium users.

RELEASE1mo ago

Anthropic releases Claude Opus 4.8 with Claude Code dynamic workflows

Anthropic rolled out Claude Opus 4.8 across Claude Code, the API, and partner surfaces, plus a research-preview workflow mode that coordinates large subagent fleets. It keeps 4.7 pricing, but early tests suggest workflow runs can burn very large token budgets, so teams should watch usage closely.

DEAL1mo ago

Cursor cuts Composer 2.5 SDK price 90% for the weekend

Cursor says Composer 2.5 is 90% off in the SDK this weekend. Same-day posts also split between fast one-shot SDK builds and complaints about dropped connections, project controls, and agent follow-through compared with Codex, so test reliability before committing.

RELEASE1mo ago

Anthropic previews /usage token breakdown for Claude Code Skills, Agents, MCPs, and Plugins

Anthropic previewed a Claude Code update that shows token usage by Skills, Agents, MCPs, and Plugins in the CLI, with desktop support planned later. The command targets skills-heavy workflows where token costs are hard to attribute across multiple moving parts.

RELEASE1mo ago

Claude Code updates /fast to Opus 4.7 at ~2.5× response speed

Anthropic made /fast default to Opus 4.7, added prompt cache diagnostics in Claude Console, published scale guidance for large codebases, and renamed extra usage to usage credits. The bundle gives teams clearer controls over latency, cache misses, and paid usage while running long Claude Code sessions.

RELEASE1mo ago

Skilled launches CLI and TUI audits for Claude Code, Codex and Grok Build

Skilled launched a CLI and TUI that scans installed skills across Claude Code, Codex, Droid, OpenCode and Grok Build. It surfaces dead skills, single-project dependencies and usage by agent or project, so teams can clean up skill sprawl.

NEWS1mo ago

Codex updates paid-plan limits with a weekend reset

Posts said Codex usage limits were reset across paid plans as users shared Mac app feedback, browser control, and repo-review results. The examples show Codex being used as a daily driver for debugging and code audit work, so watch the limits if you rely on it for regular use.

NEWS1mo ago

Claude Code updates 5-hour and weekly limits after slowdown complaints

Anthropic's ClaudeDevs account said it reset everyone's 5-hour and weekly rate limits. The reset landed alongside paid-user complaints about slow sessions and visible switching pressure toward Codex, without a root-cause or permanent policy change attached.

RELEASE1mo ago

ChatGPT adds Codex mobile preview for diffs, approvals, and long-running agent work

ChatGPT opened a Codex mobile preview that lets users review outputs, approve commands, inspect diffs, and steer long-running agent work from a phone. It matters because Codex jobs no longer stay desktop-bound, though early users say the flow can still depend on a battery-draining host machine and a clunky app UI.

RELEASE1mo ago

Grok Build opens beta with /loop, /imagine, and best-of-n agent workflows

xAI opened an early Grok Build beta for SuperGrok Heavy users, and early testers surfaced /loop, /imagine, best-of-n, self-checking, and memory features. It matters because xAI is moving from chat into a coding-and-automation CLI surface aimed at shipping apps and workflows.

RELEASE1mo ago

Claude Code adds /goal checks for long-running tasks

Anthropic added /goal to Claude Code for completion-checked runs, alongside /loop, /schedule, stop hooks, auto mode guidance, and an Opus 4.7 fast mode preview. Use /goal when a session needs to keep working until a defined condition is met; fast mode is opt-in now and becomes the default on Thursday.

RELEASE2mo ago

Claude Code introduces Agent View for parallel sessions and skills dispatch

Anthropic opened Agent View as a research preview, giving Claude Code one control pane for parallel sessions, skills dispatch, and quick replies. The change makes multi-session supervision a native workflow instead of a terminal-tab workaround.

WORKFLOW2mo ago

Hermes Agent adds 5-pillar personal-OS setups with 136K compaction rules

New Hermes Agent and Claude Code playbooks mapped memory, skills, soul, crons, and nightly GitHub sync into repeatable personal-OS setups. The guides push agent workflows into daily content and admin tasks while surfacing security and stale-memory failure modes.

WORKFLOW2mo ago

UI-TARS opens desktop control with accessibility API workflows

UI-TARS resurfaced as an open-source desktop-control stack while Opendesk described using accessibility APIs and marked elements instead of raw pixel guesses. The approach makes computer-use workflows more repeatable, but it still depends on human-oriented interfaces.

NEWS2mo ago

Claude Code claims debug logs after 3-minute hang reports

Creators reported Claude Code sessions hanging for minutes with no status feedback, and an Anthropic engineer said responsiveness improvements and self-serve debug logs are on the way. Users also say Claude Desktop now shows context-window usage, giving long sessions a clearer limit indicator.

RELEASE2mo ago

Claude Code fixes 60-plus bugs in 1M-context sessions

Anthropic says this week's Claude Code release fixes long-running session bugs across 1M-context prompts, caching, auth fallbacks, MCP retries, and terminal rendering. The update targets the stability problems that surface in longer agent runs and heavier IDE workflows.

NEWS2mo ago

Claude Code supports 21-agent pipelines at Code with Claude San Francisco

At Code with Claude San Francisco, builders showed Claude Code running 21-agent app pipelines across Figma, Jira, Confluence, and TestFlight. Users should watch for reliability strain as posts and conference recaps tie recent slowdowns to Anthropic's reported 80x growth.

RELEASE2mo ago

OpenAI Codex updates CLI v0.129.0 with Vim mode and hook controls

OpenAI Codex CLI v0.129.0 adds Vim mode, redesigned resume flows, stronger plugin management, and hook controls, while GOALS also reached the Linux app. The update makes long-running refactors and persistent task loops more structured across CLI and app use.

WORKFLOW2mo ago

Claude Code supports 21-agent app builds with 72-minute App Store demos

Builder demos show Claude Code being run as a structured team of specialist agents, including a 21-agent setup that Aakash Gupta says shipped from idea to App Store submission in 72 minutes. The workflow shifts the bottleneck from typing code to specs, review, and product judgment, making Claude Code look more like a product-build system than a code assistant.

RELEASE2mo ago

SubQ launches 12M-token SSA model with SubQ Code early access

SubQ launched a sub-quadratic sparse-attention model with a 12 million token context window and opened early access alongside SubQ Code. The company claims 52x faster 1M-token performance than FlashAttention and under 5% of Opus cost, putting long-context coding workflows into a new price and latency band.

NEWS2mo ago

Users report OpenAI Codex raises limits 10x on May 5 reset

Users report OpenAI increased Codex limits about 10x on the May 5 reset, with much longer /goal sessions and more computer-use demos. That should extend unattended runs for app migrations and visual prototyping.

RELEASE2mo ago

Claude Code adds phone push notifications and 67% faster /resume

Claude Code added phone push notifications for long-running tasks and published 50-plus stability fixes across recent CLI releases, including up to 67% faster /resume on large sessions. The update matters because mobile alerts, auth fixes and lower-memory startup remove several workflow interruptions from daily coding use.

NEWS2mo ago

Claude Code limits Pro plans to Sonnet 4.5 and moves Opus behind extra usage

Anthropic support docs now say Claude Pro users in Claude Code need extra usage to access Opus, with Sonnet 4.5 as the default. Separate user posts report mismatched receipts and an unverified $200 overage case, making spend harder to predict.

RELEASE2mo ago

OpenClaw adds voice personas with 43ms first output benchmarks

OpenClaw contributors posted a voice-persona feature and fresh performance numbers that cut first output from 1s to 43ms. Separate posts describe 300-user sandboxed deployments and stronger PR, CI, and testing workflows, pointing to team-scale use beyond hobby demos.

RELEASE2mo ago

Claude Code adds web-mobile session handoff with claude --teleport

Anthropic refreshed Claude Code on web and mobile with a sessions sidebar, routines view, faster responses, and claude --teleport handoff to the CLI. Use it to start work on web or mobile and continue in a terminal with branch state intact.

RELEASE2mo ago

Codex App Server adds Fedora RPM support for Linux installs

Codex App Server added a Fedora RPM package for Linux installs as users pushed Codex into browser control, 3D-print setup, and rapid game prototypes. Watch for more repeatable desktop workflows as Codex moves beyond chat-only experiments.

NEWS2mo ago

Claude Code fixes regressions in v2.1.116 and resets usage limits

Anthropic published a post-mortem on Claude Code regressions, said the problems lived in the harness rather than the models or API, and reset subscriber usage limits after fixes. The update matters for long agent sessions because recent complaints centered on reliability, wasted effort, and broken trust.