Skip to content
AI Primer
release

Claude Opus 4.8 releases fast mode at 2.5x speed and dynamic workflows

Anthropic released Claude Opus 4.8 with effort control, a fast mode, and Claude Code dynamic workflows for parallel subagents. Hacker News testers reported stronger layout output, but some also saw higher costs and weaker results than Opus 4.7 on specific tasks.

5 min read
Claude Opus 4.8 releases fast mode at 2.5x speed and dynamic workflows
Claude Opus 4.8 releases fast mode at 2.5x speed and dynamic workflows

TL;DR

  • Anthropic shipped Claude Opus 4.8 at the same base API price as 4.7, added user-facing effort control in claude.ai, and introduced a research-preview fast mode that runs at 2.5x speed, according to the main HN thread and the official launch post.
  • Claude Code's new dynamic workflows feature lets the model write orchestration scripts that fan work out across tens to hundreds of parallel subagents, as trq212 pointed out and Anthropic detailed in its dynamic workflows post.
  • One quiet API change may matter just as much for agent builders: the HN discussion roundup highlighted Opus 4.8's new mid-conversation system messages, which the Claude API docs say preserve prompt-cache hits while updating instructions mid-run.
  • Early hands-on reactions split quickly. the HN discussion roundup included one tester who said Opus 4.8 nailed a one-file RTS game and another who got a better crossword layout than previous models, while the same roundup also captured a user who said it performed worse than 4.7 on a data-extraction test.
  • The Claude Code workflow layer is already creating its own micro-conventions, with petergyang complaining about agents.md versus Claude.md, then petergyang and bentossell swapping workarounds for shared repo instructions.

You can read Anthropic's launch post, skim the what's new in Claude Opus 4.8 doc for the buried API changes, and browse the dynamic workflows announcement plus Anthropic's follow-up on workflow harnesses. The HN thread also surfaced two unusually specific smoke tests, a one-file RTS game and a crossword layout check, which is a better read on creative usefulness than another vendor chart.

What shipped

Anthropic Announces Claude Opus 4.8 with New Features and Capabilities

Anthropic has released Claude Opus 4.8, an upgrade to its predecessor that offers benchmark improvements and enhanced collaboration capabilities. The model introduces several key features: an effort control setting in claude.ai that allows users to adjust the depth of the model's processing, a dynamic workflows feature for Claude Code that enables handling large-scale tasks using parallel subagents, and a fast mode for Opus 4.8 that operates at 2.5× speed and is three times cheaper than previous versions. Regular usage pricing remains at $5 per million input tokens and $25 per million output tokens, while fast mode is priced at $10 per million input tokens and $50 per million output tokens. The model is accessible via the Claude API as claude-opus-4-8.

Anthropic framed Opus 4.8 as a same-price upgrade over 4.7, with benchmark gains plus three concrete product changes: effort control in claude.ai, dynamic workflows in Claude Code, and fast mode for Opus 4.8 in research preview, per the official announcement.

The launch pieces split across surfaces like this:

  • Model ID: claude-opus-4-8, per the Claude API docs
  • Base pricing: $5 per million input tokens, $25 per million output tokens, unchanged from 4.7, per the launch post
  • Fast mode pricing: $10 input, $50 output per million tokens, with 2.5x speed, per the launch post
  • Context window: 1M tokens by default on Anthropic API, Bedrock, and Vertex AI, with 200k on Microsoft Foundry, per the what's new doc
  • Max output: 128k tokens, per the what's new doc

Anthropic also slipped in a lower 1,024-token minimum cacheable prompt length in the API docs. That is the kind of small plumbing change that matters more in long agent runs than another marketing adjective.

Dynamic workflows

Anthropic's dynamic workflows announcement says Claude now writes JavaScript orchestration scripts on the fly, then uses them to coordinate tens to hundreds of parallel subagents in one session. The GitHub Claude Code v2.1.154 release describes the same feature more plainly: ask Claude to create a workflow, and view runs with /workflows.

The official material keeps returning to the same three mechanics:

  • Parallelism: workflows spread tasks across many subagents instead of a single long turn loop
  • Verification: Anthropic says Claude checks work before returning it
  • Background execution: the work runs outside the main chat flow, which keeps the session free for other tasks

Anthropic's follow-up post on workflow harnesses adds a useful detail: workflows can choose different models for different agents and can run subagents in isolated worktrees. That pushes Claude Code closer to an orchestration layer than a single assistant.

Prompt cache plumbing

Discussion around Claude Opus 4.8

Thread discussion highlights: - simonw on mid-conversation system messages: “Claude Opus 4.8 accepts role: "system" messages immediately after a user turn… This lets you append updated instructions later in a long-running conversation without restating the full system prompt, which preserves prompt cache hits…” - senko on coding benchmark test: “My fav coding benchmark for frontier models is to build a simple RTS game in one file… Claude Code with Opus 4.8 in ultracode mode nailed it…” - jkxyz on creative output test: “My smoke test for new models is to get it to generate a crossword, and this is the first time it's done a good job on the layout…”

One of the least flashy launch notes was the most immediately practical. As the HN discussion roundup flagged, Opus 4.8 now accepts role: "system" messages immediately after a user turn.

According to the Claude API docs, that means developers can update instructions mid-task without re-sending the whole top-level system prompt and blowing away cache hits on earlier turns. Simon Willison made the same point in his Substack write-up, calling it a way to steer long-running agents without paying the full token tax again.

For people building multi-step creative or coding agents, this pairs neatly with dynamic workflows: one feature spreads work out, the other makes it cheaper to keep changing the rules mid-run.

Smoke tests, not just charts

Discussion around Claude Opus 4.8

Thread discussion highlights: - simonw on mid-conversation system messages: “Claude Opus 4.8 accepts role: "system" messages immediately after a user turn… This lets you append updated instructions later in a long-running conversation without restating the full system prompt, which preserves prompt cache hits…” - senko on coding benchmark test: “My fav coding benchmark for frontier models is to build a simple RTS game in one file… Claude Code with Opus 4.8 in ultracode mode nailed it…” - jkxyz on creative output test: “My smoke test for new models is to get it to generate a crossword, and this is the first time it's done a good job on the layout…”

Anthropic's own copy talks about improved collaboration and "creative mastery," but the most interesting evidence came from small, weird tests in public. In the HN discussion roundup, one commenter said Claude Code with Opus 4.8 in ultracode mode nailed a simple RTS game built in one file, while another said it was the first model to generate a crossword with a good layout.

Claude Opus 4.8

The creative angle is that Anthropic is explicitly talking about ‘creative mastery,’ and commenters are using image/layout-style smoke tests like RTS game generation and crossword generation. The discussion suggests the model may be better at structured creative output and layout than prior versions, but people are still skeptical of benchmark claims and want more evidence from real tasks.

Those are narrow tests, but they are concrete. They suggest the upgrade may be stronger on structured output where layout, consistency, and multi-part coordination all matter at once.

The pushback showed up just as fast. the HN discussion roundup also quoted a user who said 4.8 did worse than 4.7 on a data-extraction task and felt almost 2x as expensive in their own usage. Meanwhile thekitze posted a much less measured verdict.

Claude.md versus AGENTS.md

A separate mini-story broke out around the instruction files people are using to steer coding agents. petergyang complained that Codex uses agents.md while Claude Code uses Claude.md, and the post drew thousands of likes because everyone has already run into the naming mess.

The workarounds arrived immediately. petergyang said he now points Claude.md at a shared AGENTS.md, while bentossell said he keeps a root agents file that tells subagents to reuse that structure. Even danshipper's repost of a Claude Code plugin for hands-off QA hints at the next layer here: once teams start stacking custom commands, shared skills, and workflow files, the file convention becomes part of the product.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 3 threads
TL;DR1 post
Smoke tests, not just charts1 post
Claude.md versus AGENTS.md2 posts
Share on X