Skip to content
AI Primer
update

Claude users report silent fallback and 30-day retention after Fable 5 launch

Anthropic said flagged frontier-LLM requests will visibly fall back to Opus 4.8 after complaints about hidden downgrades and 30-day retention. If you run Claude in production, watch for fallback behavior and verify retention settings before deployment.

5 min read
Claude users report silent fallback and 30-day retention after Fable 5 launch
Claude users report silent fallback and 30-day retention after Fable 5 launch

TL;DR

You can read Anthropic's launch post, inspect the AWS Bedrock model card, and then compare that paperwork with Hangsiin's system card excerpt, ClaudeDevs' reversal, and the Hacker News launch thread. The weird part was not that Fable had safeguards. It was that Anthropic launched a flagship model with hidden fallback behavior, mandatory retention, and a retry-credit document waiting in the wings.

Fable 5 and Mythos 5

Anthropic framed Fable 5 as the generally available version of its new Mythos-class model. According to claudeai's launch post, it was the most capable model the company had made broadly available.

The split mattered immediately:

Anthropic's own setup guide added a few model-level changes that help explain why users noticed the release so fast. ClaudeDevs' getting-started thread said thinking is always on, effort controls are still available, and older Claude prompts or skills may now be too prescriptive.

Hidden fallback

The launch post disclosed fallback for cyber, bio, chemistry, and distillation topics. The sharper detail came from the system card excerpt that Hangsiin pulled out: frontier-LLM-development requests could be silently limited through prompt modification, steering vectors, and PEFT, and Anthropic estimated that path would touch about 0.03% of traffic.

That 0.03% estimate collided with a much broader-looking field reaction. GergelyOrosz's SemiAnalysis example claimed SemiAnalysis had already triggered the safeguard, while eliebakouch's critique argued the public evidence for the classifier threshold was thin and the scope of affected work was unclear.

The credibility problem was simple. Users were paying for Fable 5 without an obvious way to tell whether a given response came from the full model, a steered version, or Opus 4.8 instead.

Visible rollback

Anthropic reversed course less than two days later. In ClaudeDevs' apology, the company said invisible safeguards were the wrong tradeoff and that flagged requests would visibly fall back to Opus 4.8, matching its existing cyber and bio behavior.

That change landed after a pile of concrete complaints:

Anthropic's official line now matches the support flow: ClaudeDevs' apology said API refusals will return a reason, and the appeals article gives separate reporting paths for Claude Code, Claude surfaces, and API requests.

30-day retention

The other launch-day shock was contractual, not model-behavioral. The official launch post says Anthropic now requires 30-day retention for Fable 5, Mythos 5, and future models with similar or higher capability levels across both first- and third-party surfaces, while also saying the data will not be used to train new Claude models.

That clause spilled directly into infrastructure docs. The AWS Bedrock model card says Fable 5 has a 1 million token context window, 128K max output, always-on adaptive thinking, and a required provider_data_share retention mode. It also says there was no console UI for that setting at launch.

Community reaction focused on the compatibility break. scaling01's retention post argued zero-retention customers were effectively locked out, and GergelyOrosz's open-models post said the combination of stricter usage controls, less transparency, and stored prompts pushed him toward open models and local inference.

Fallback credit

The cleanest sign that Anthropic expected refusal-and-retry loops may be the AWS paperwork, not the tweets. The AWS fallback credit page documents a beta token, fallback-credit-2026-06-09, that prevents users from paying cache-write charges twice when a refused Fable 5 request is retried on another model such as Opus 4.8.

That page adds mechanics the launch thread never mentioned:

  • Fable 5 refusal responses can issue a fallback_credit_token.
  • The retry has to match the original system, messages, and tools, or append the refused partial output as a continuation.
  • The token expires after five minutes.
  • Without the beta flag on both calls, the token is rejected.

AWS also surfaced a related community complaint fast enough to spawn its own discussion thread. The Hacker News Bedrock thread centered on the requirement to share data with Anthropic for Mythos-class models, which put the retention policy in front of infrastructure teams that might never read a 300-page system card.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 5 threads
TL;DR2 posts
Fable 5 and Mythos 54 posts
Hidden fallback2 posts
Visible rollback3 posts
30-day retention2 posts
Share on X