Agent Infrastructure
Backend primitives and platform services designed for autonomous agents as the primary consumer — agent-native storage, sandboxes, queues, and runtime infra.
Stories
Filter storiesCodex users shared 56-hour task runs, PM-to-PR workflows, and a new black-box session recorder for tracking drift, token use, and incomplete responses. The longer autonomous sessions matter because browser auth gaps, passkey failures, and tool-selection bugs become real blockers once Codex is used beyond quick code generation.
Google said AI Studio can now build Android apps directly, add managed agents, and export projects into Antigravity with one click. Paired with Antigravity’s new 2.0 app, CLI, and SDK, the stack moves Google’s prompt-to-product workflow beyond chat demos into runnable apps and multi-agent builds.
Skilled launched a CLI and TUI that scans installed skills across Claude Code, Codex, Droid, OpenCode and Grok Build. It surfaces dead skills, single-project dependencies and usage by agent or project, so teams can clean up skill sprawl.
Anthropic said paid Claude plans will get a dedicated monthly budget for Agent SDK, claude -p, GitHub Actions, and Agent SDK apps starting June 15. Keep chat use and programmatic use separate, and note the temporary 50% weekly Claude Code increase through July 13.
Holaboss launched holaOS Beta 0.1 with permanent workspaces, sub-agents, and a dashboard for recurring tasks and review loops. Use it if you want persistent project memory instead of reset-every-run agent chats, though the evidence is mostly launch-thread documentation.
Anthropic opened Agent View as a research preview, giving Claude Code one control pane for parallel sessions, skills dispatch, and quick replies. The change makes multi-session supervision a native workflow instead of a terminal-tab workaround.
Anthropic said a SpaceX partnership will add compute capacity, and it doubled Claude Code 5-hour limits for paid plans. It also removed peak-hour reductions for Pro and Max and raised Opus API limits; the change should reduce throttling for heavy users.
Anthropic introduced dreaming as a research preview in Claude Managed Agents alongside multiagent orchestration, rubric-based self-improvement, and webhook updates. Sub-agents now share a container and filesystem, so teams can coordinate longer-running work and manage memory across sessions.
SubQ launched a sub-quadratic sparse-attention model with a 12 million token context window and opened early access alongside SubQ Code. The company claims 52x faster 1M-token performance than FlashAttention and under 5% of Opus cost, putting long-context coding workflows into a new price and latency band.
Weekend builder posts showed OpenAI Codex using /goal to keep working across turns, with Linux clients and ephemeral runner tools extending longer sessions. It matters for vibe-coders packaging Codex into unattended loops, but usage limits and community wrappers still vary by plan and platform.
Cross-author demos showed Hermes using self-rewriting skill files, timeboxed subagents, and recurring brief workflows that improved over repeated runs. It matters because creators and vibe-coders can compound agent behavior across sessions, though the evidence still comes from user-run setups rather than a full official product brief.
Multiple practitioners showed Codex reviewing every main-branch commit, spawning fix loops, and opening browser sessions when APIs or web apps blocked the normal path. The workflow matters because Codex is being used as a browser-native coworker for coding, writing, analytics, and media plugins, but the pattern is emerging from user experiments rather than a formal OpenAI release.
OpenClaw contributors posted a voice-persona feature and fresh performance numbers that cut first output from 1s to 43ms. Separate posts describe 300-user sandboxed deployments and stronger PR, CI, and testing workflows, pointing to team-scale use beyond hobby demos.
Anthropic published a post-mortem on Claude Code regressions, said the problems lived in the harness rather than the models or API, and reset subscriber usage limits after fixes. The update matters for long agent sessions because recent complaints centered on reliability, wasted effort, and broken trust.
Elon Musk said X API access is now available through OpenClaw, while users posted travel-assistant and always-on marketing setups built around the tool. The new access broadens what OpenClaw agents can automate, but most concrete examples still come from operator threads rather than product docs.