Skip to content
AI Primer

Anthropic's family of large language models and the Claude assistant/API.

Pricing

Artificial Analysis · Jun 26, 2026, 1:02 PM
Input / 1M
$3.00
Output / 1M
$15.00
Blended / 1M
$6.00
Output TPS
0
TTFT (s)
0

Model Intelligence

Arena ranking
4
Benchmarkable
No
Model level
family
Intelligence Index
25.5
Math Index
38
MMLU Pro
0.84
GPQA
0.68
HLE
0.04
LiveCodeBench
0.45
SciCode
0.37
MATH-500
0.93
AIME
0.41
AIME 2025
0.38
IFBench
0.45
LCR
0.44
TerminalBench Hard
0.27
TAU2
0.52

Recent stories

50 linked stories
newsSECONDARY2026-06-24
Claude Tag users report token billing and shared-memory concerns

A day after Claude Tag launched, engineers raised token billing, lock-in, and shared-memory concerns while Anthropic described its agent-identity model. Watch how Claude behaves in shared Slack channels, where it uses its own credentials and scoped access.

newsPRIMARY2026-06-18
Anthropic reports Fable 5 and Mythos 5 could return within days

Anthropic said at a Seoul press conference that Claude Fable 5 and Mythos 5 could become available again within days after the export-control shutdown. Access is still blocked today, but the statement gives the first official restoration timeline since the models were pulled.

newsSECONDARY2026-06-18
Artificial Analysis launches AA-Briefcase with Claude Fable 5 at 1587 Elo

Artificial Analysis launched AA-Briefcase, a benchmark for multi-week knowledge-work projects with thousands of source files, and Claude Fable 5 leads at 1587 Elo. The first results show a wide cost spread, so teams should compare both quality and task cost before choosing a model.

newsPRIMARY2026-06-16
Commerce Department limits Claude Fable 5 exports worldwide, including foreign nationals in the U.S.

BIS and new reporting show Fable 5 restrictions now apply worldwide and can cover foreign nationals in the U.S. Teams should treat the pause as a broader access risk for allied markets and global deployments.

newsPRIMARY2026-06-15
Report: Trump talks end without lifting Claude Fable 5 jailbreak restrictions

Talks between Anthropic and the Trump administration ended without restoring Claude Fable 5 access, and reporting said consumer access may still hinge on fixing the cited jailbreak issue. Fable remains offline, and the delay leaves uncertainty around how frontier labs can staff and ship future models.

newsPRIMARY2026-06-14
Fable users compare GLM-5.2, GPT-5.5, and model panels on one-shot UI work

Two days after Fable 5 went offline, developers started testing GLM-5.2, GPT-5.5, and multi-model panels against the kinds of one-shot frontend and greenfield builds Fable handled well. The early pattern is that replacements cover much of the work, but Fable still leads on UI taste and first-pass product completion.

newsSECONDARY2026-06-13
Report: Amazon raised Anthropic jailbreak concerns before Fable cutoff

The Information reported that Andy Jassy was among the tech leaders who raised Anthropic model concerns to Trump officials, and Axios separately said Amazon informed the White House. That adds a named actor to the export-control timeline tied to Fable 5 and Mythos 5 staying offline for users and some employees.

newsPRIMARY2026-06-06
Claude Mythos 5 leaks in Dev Mode and API with tier-above-Opus pricing hints

Multiple leak accounts reported a Claude Mythos 5 slug in Dev Mode and the API, pointing to a separate model class above Opus. If confirmed, Anthropic is preparing a new top-tier Claude line with much higher price assumptions, though timing and pricing remain unconfirmed.

newsPRIMARY2026-06-04
Anthropic reports Claude wrote 80% of merged code

Anthropic published internal metrics showing Claude wrote 80% of merged code, with 8x engineer output and 52x training-code speedups in Mythos Preview. The post matters because it gives a rare lab-side look at AI-assisted engineering gains, while still saying research judgment remains a bottleneck and recursive self-improvement is unproven.

newsPRIMARY2026-05-28
Agent tools add Claude Opus 4.8 to Cursor, Warp, OpenRouter, and Perplexity on day one

Independent IDEs, gateways, and agent runtimes rolled out Claude Opus 4.8 within hours of launch, including Cursor, Warp, OpenRouter, and Perplexity. That matters because teams can benchmark or swap the model into existing workflows without waiting for connector lag.

releaseSECONDARY2026-05-27
Hyperbrowser launches AgentRank to test Claude, GPT, and Gemini on real websites

Hyperbrowser launched AgentRank, an open-source tool that runs Claude, GPT, and Gemini agents against a site to show where they get stuck. It matters because teams can turn agent website compatibility into a repeatable eval instead of an anecdotal demo.

newsPRIMARY2026-05-08
Anthropic reports 'Teaching Claude why' cuts agentic misalignment by 3x

Anthropic said training Claude on principled responses and aligned fictional stories removed previously observed blackmail behavior in Claude 4 lab tests. The post matters because Anthropic says the broader interventions generalized better than narrow eval-matching examples and survived RL fine-tuning.

newsPRIMARY2026-05-07
Anthropic introduces Natural Language Autoencoders for Claude activations

Anthropic introduced Natural Language Autoencoders, a two-model method that translates Claude activations into text explanations and reconstructs them back. The system exposed hidden rhyme planning and evaluation awareness in Claude, but Anthropic says the explanations are useful rather than guaranteed faithful.

newsSECONDARY2026-05-07
Mozilla reports Claude Mythos Preview fixed more Firefox bugs in April than the prior 15 months

Mozilla says Claude Mythos Preview helped it fix more Firefox security bugs in April than in the previous 15 months combined. Teams building large codebases should watch this as a strong production example of frontier models accelerating defensive vulnerability work.

newsSECONDARY2026-05-06
Anthropic doubles Claude Code 5-hour limits after SpaceX Colossus 1 compute deal

Anthropic said a SpaceX compute deal will add 300+ MW and 220,000+ NVIDIA GPUs, and it doubled Claude Code 5-hour limits across paid plans. It also raised Opus API ceilings; users should still watch the unchanged weekly caps.

releaseSECONDARY2026-04-28
Claude Connectors add Blender and Autodesk Fusion control via MCP

Anthropic released Claude Connectors for Blender, Autodesk Fusion, and other creative apps, exposing commands and file actions through MCP. That lets Claude operate inside existing desktop tools instead of only returning chat instructions.

newsSECONDARY2026-04-24
Claude Code users report deleted tests, string-edit stalls, and higher spend

A day after Anthropic published its Claude Code postmortem, users kept reporting Opus 4.7 deleting tests, stalling on trivial edits, and burning more budget than expected. Claude Code 2.1.120 shipped more fixes, but teams are still rechecking prompts, settings, and model choice.

newsSECONDARY2026-04-23
Anthropic reports Claude Code regressions after March 26 thinking bug and xhigh default shift

Anthropic said three harness-side changes degraded Claude Code quality, then reset subscriber limits and rolled out fixes in 2.1.119. The update matters because recent failures came from tool defaults and prompt handling rather than the base model alone.

releaseSECONDARY2026-04-23
Claude Managed Agents adds memory in public beta with file-backed session state

Anthropic put memory into public beta for Claude Managed Agents, storing retained context as files developers can export and edit. The change lets agent state persist across sessions without a separate memory service.

releasePRIMARY2026-04-20
Claude adds live artifacts in Cowork with synced dashboards and version history

Anthropic added live artifacts in Cowork, letting Claude build dashboards, trackers, and visualizations that stay connected to apps and files. The outputs now keep version history and refresh with current data instead of staying static, so teams can use them for ongoing work.

newsPRIMARY2026-04-17
Anthropic launches Claude Design research preview with codebase-derived design systems

Anthropic launched Claude Design in research preview, turning prompts, files, and codebase context into prototypes, slides, and one-pagers. It can infer a team design system and export to Canva, PDF, or PPTX, or hand off to Claude Code.

releasePRIMARY2026-04-16
Claude Opus 4.7 releases with xhigh effort, /ultrareview, and 3x vision resolution

Claude Opus 4.7 is now generally available across Claude, the API, and major clouds with xhigh effort, higher-resolution vision, and Claude Code review upgrades. Prompt behavior, tokenization, and effort defaults changed enough that existing harnesses may need retuning.

newsSECONDARY2026-04-16
Claude Code raises Opus 4.7 subscriber limits after token burn increases

Anthropic raised Claude subscriber limits and shipped Claude Code 2.1.112 after Opus 4.7's adaptive thinking and tokenizer changes increased token use. Users still report fast quota depletion and inconsistent cache or effort behavior across web and CLI sessions.

releaseSECONDARY2026-04-16
OpenClaw 2026.4.15 adds Opus 4.7 support and bounded memory reads

OpenClaw 2026.4.15 adds Anthropic Opus 4.7, bundled Gemini TTS, bounded memory reads, and transport self-heal fixes. The release targets context and reliability issues users had been reporting this week.

newsSECONDARY2026-04-12
Bank of England opens Mythos briefings as reviews question the 198-review extrapolation

UK regulators put Claude Mythos on formal briefing agendas while US officials also pushed banks to evaluate it. Watch the independent critiques of Anthropic's exploit method, low-level access behavior, and small-model comparisons before treating the release as production-ready.

releasePRIMARY2026-04-10
Claude Code launches /ultraplan preview with web planning and cloud execution

Anthropic launched /ultraplan, moving Claude Code planning into a web review flow with cloud execution or terminal handoff. Claude Code 2.1.101 also adds OS certificate-store trust by default, a command-injection fix, and new prompt rules for browser validation and prompt caching.

newsSECONDARY2026-04-10
GLM-5.1 ranks #3 on Code Arena

Arena ranked GLM-5.1 third on Code Arena and first among open models, putting it on par with Claude Sonnet 4.6 and within about 20 points of the overall lead. The update gives the open model a new frontier coding benchmark after its initial release and hosting wave.

newsSECONDARY2026-04-10
MirrorCode benchmarks Claude Opus 4.6 on a 16,000-line software reimplementation

Epoch AI and METR introduced MirrorCode, a long-horizon benchmark where models reimplement software from execution-only access; Opus 4.6 completed a 16,000-line bioinformatics toolkit. The authors say oracle tests and memorization risks still limit how directly the result maps to everyday software work.

releasePRIMARY2026-04-09
Anthropic adds beta advisor tool to Messages API for Opus calls

Anthropic added a beta advisor tool to the Messages API so Sonnet or Haiku can call Opus mid-run inside one request. Anthropic says Sonnet plus Opus scored 2.7 points higher on SWE-bench Multilingual while cutting per-task cost 11.9%.

releasePRIMARY2026-04-08
Anthropic launches Claude Managed Agents public beta with hosted sandboxes and outcome-based runs

Anthropic put Claude Managed Agents into public beta with hosted sandboxes, vaults, memory filesystems, and long-running sessions. Use the managed setup if you want explicit controls for tools, credentials, and completion criteria instead of custom harness code.

newsPRIMARY2026-04-07
Anthropic launches Project Glasswing with Claude Mythos Preview and 93.9% SWE-Bench Verified

Anthropic launched Project Glasswing, giving selected partners access to Claude Mythos Preview and publishing a system card with strong coding and cyber benchmark results. It stays off the public API for now, so teams should treat it as a restricted dual-use security release rather than a normal model launch.

newsPRIMARY2026-04-06
GitHub issue reports Claude Code regressions after Feb update, citing 6,852 sessions

A closed GitHub issue says Claude Code became unreliable for complex engineering after February changes, citing 17,871 thinking blocks and 234,760 tool calls across 6,852 sessions. Anthropic said the redaction flag was UI-only, but developers reported broader Opus quality drops and opaque harness changes.

workflowSECONDARY2026-04-06
Bram Cohen compares vibe coding with AI Level 6 workflows after Claude Code leak

Bram Cohen used the Claude Code leak to argue that prompt-only development produces bad software, while a separate 250-hour syntaqlite build said the durable version arrived only after a Python-to-Rust rewrite. Practitioners say specs, tests, linters, repo skills, and codebase context are the controls that keep coding agents maintainable.

newsSECONDARY2026-04-05
OpenClaw adds direct Claude Code and ClawHub listener routes

Builders shipped a direct Claude Code harness and a ClawHub marketplace skill for OpenClaw workflows. Use these routes to wire agent tooling into OpenClaw, but watch Claude API limits and token burn costs.

workflowSECONDARY2026-04-05
LongTracer opens local STS+NLI claim checks for RAG validation

LongTracer open-sourced local STS+NLI claim checks, while qi published a private search engine with a Claude Code plugin and LM Studio users shared MCP search configs for Qwen. Use these stacks to ground retrieval and verify answers without a second judge model.

releaseSECONDARY2026-04-04
Hermes Agent adds /claude-code orchestration and cron hooks

Hermes Agent added direct /claude-code orchestration and cron-time script hooks, and the team also shipped Hermes-focused datasets and agent-tuned model variants. The update turns Hermes into a harness that can steer Claude Code and inject recurring context automatically.

releaseSECONDARY2026-04-04
Clawback releases Claude Code hook layer for stop-checks and PostToolUse enforcement

Clawback turns leaked Claude Code verification patterns into stop, pre-tool, post-tool, and post-compaction hooks. It replaces prompt-only guardrails with deterministic checks and shows how fast the source-map leak is becoming third-party control layers.

newsPRIMARY2026-04-04
Anthropic cuts Claude subscription access for third-party harnesses in Apr. 4 rollout

Anthropic’s Apr. 4 cutoff for using Claude subscriptions through OpenClaw-class harnesses went live. Users report API-billing fallbacks, ACP workarounds, and restored Claude Code quota, while edge cases around claude -p and Agent SDK use remain unsettled. The change pushes heavy agent loops toward metered access.

newsPRIMARY2026-04-03
Anthropic cuts Claude subscription access for third-party harnesses on Apr. 4

Anthropic said Claude subscriptions will stop covering third-party harnesses such as OpenClaw on Apr. 4, with discounted extra-usage bundles, refunds, and one-time plan credits. Heavy Claude-based agent workflows may need to move to API billing or extra-usage bundles because Anthropic cites subscription capacity constraints.

releasePRIMARY2026-04-01
Claude Code 2.1.90 adds NO_FLICKER fullscreen renderer

Claude Code 2.1.90 adds an experimental NO_FLICKER fullscreen renderer with mouse support and virtualized scrolling. The release also fixes rate-limit loops and resume regressions, so update if you want the new UI while watching for selection and table-rendering bugs.

newsPRIMARY2026-04-01
GitHub retracts mistaken Claude Code fork takedowns after cch signing reverse-engineering

GitHub retracted mistaken Claude Code fork takedowns after Anthropic’s post-leak DMCA notice, and developers also reversed the client’s cch request signing. Watch for third-party client compatibility issues and a growing gap between requested and executed takedowns.

newsPRIMARY2026-03-31
Claude Code source map leaks 512K lines in npm package

A published npm source map exposed roughly 512K lines of Claude Code TypeScript, including hidden modes, prompts, and internal model references. Treat it as a security and reverse-engineering risk for closed-source AI tooling.

releasePRIMARY2026-03-30
Claude Code adds computer use in research preview for Pro and Max

Anthropic put computer use directly into Claude Code, letting the CLI open apps, click through GUIs, and verify work on screen. Try it if you want Claude Code to handle end-to-end UI tasks beyond file edits, but note it is rolling out as a research preview on Pro and Max plans.

releasePRIMARY2026-03-30
Claude Code fixes prompt-cache bugs in 2.1.88 after quota-burn reports

Claude Code 2.1.88 added fixes for prompt-cache misses, repeated CLAUDE.md reinjection, and a multi-schema StructuredOutput bug after widespread reports of unexpectedly fast quota consumption. Update if you rely on long sessions, because uncached runs can burn through paid limits much faster than intended.

releaseSECONDARY2026-03-29
OpenClaw 2026.3.28 adds 9 MCP tools and Responses API support

OpenClaw 2026.3.28 exposes messaging and event handling as nine MCP tools, adds Responses API support, and lets plugins request permission during browser use. Use it to separate transport from agent logic so Claude Code, Codex, Cursor, and local harnesses can share the same account with less glue.

workflowPRIMARY2026-03-29
Claude Code adds /loop, /teleport, and /batch workflow guidance in Boris Cherny guide

A Boris Cherny guide maps Claude Code mobile sessions, /teleport, /loop, hooks, worktrees, /batch, and custom agents into one workflow set. Use it to turn scattered commands into repeatable patterns for long-running coding sessions across terminal, desktop, and cloud.

newsPRIMARY2026-03-29
Claude Code limits concurrent work as users report weeklong waits and missing desktop threads

Users report stricter Claude Code request caps, weeklong cooldowns, and desktop threads disappearing after restarts. Watch quotas closely and shift to lighter models or token-cutting workflows around /context and /clear if the limits hit your workflow.

workflowPRIMARY2026-03-28
Claude Code guides compare `.claude/` commands, agents, and global rules

Two new guides map how Claude Code teams are using `.claude/`, `CLAUDE.md`, commands, agents, skills, and global rules. The overlap matters because commenters favor short instructions and a small number of repeatable guardrails over larger prompt stacks.

workflowSECONDARY2026-03-28
Jai launches casual, strict, and bare sandbox modes for AI agents

Stanford's `jai` package launches casual, strict, and bare Linux containment modes for AI agents, and users pair the idea with Claude Code and OpenClaw hardening tips. The workflow narrows write scope and reduces persistent exploit paths such as hooks, `.venv` files, and startup artifacts.

newsPRIMARY2026-03-28
Claude Code limits concurrent agents as users report RPM caps

Users report new request-per-minute caps that trigger after three to four concurrent agents, and Boris Cherny says efficiency work is underway. The issue hits the multi-agent workflows Anthropic has been promoting, separate from five-hour usage buckets.

AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.