MODEL RELEASELANGUAGEreleaseAnthropic

Claude Sonnet 4.6

Hybrid reasoning model with superior intelligence for agents, featuring a 1M context window

Anthropic's current Claude Sonnet language model release, presented as a hybrid reasoning model for coding, agents, and professional workflows with a 1M token context window.

Pricing

Official site · Mar 30, 2026, 7:28 AM

Input / 1M

$3.00

Output / 1M

$15.00

Cached input / 1M

$0.30

Prompt caching write pricing is also published by Anthropic: 5-minute cache writes cost $3.75/M tokens and 1-hour cache writes cost $6/M tokens. Cache hits & refreshes cost $0.30/M tokens.

Anthropic's official pricing page lists Claude Sonnet 4.6 at $3 per million input tokens and $15 per million output tokens. The same page lists cache hits & refreshes at $0.30 per million tokens, with prompt caching writes at $3.75/M (5m) and $6/M (1h). The Sonnet product page also states pricing starts at $3/$15 per million tokens.

View source

Model Intelligence

Context window

1,000,000 tokens

Arena ranking

Benchmarkable

Yes

Model level

release

Intelligence Index

44.4

Coding Index

46.4

GPQA

0.8

HLE

0.13

SciCode

0.47

IFBench

0.41

LCR

0.58

TerminalBench Hard

0.46

TAU2

0.8

Recent stories

2 linked stories

newsSECONDARY2026-03-29

Claude Code limits concurrent workflows as users share 60% token-cut tactics

Claude Code users reported steeper caps and week-long waits while sharing ways to cut usage, including /context audits, /clear, smaller models, and RTK log compression. The posts point to token burn from mounted MCP servers, long chat history, raw logs, and multi-agent concurrency, so teams may need to trim runtime load.

releaseSECONDARY2026-03-13

Anthropic launches 1M-token context for Opus 4.6 and Sonnet 4.6 at flat pricing

Anthropic made 1M-token context generally available for Opus 4.6 and Sonnet 4.6, removed the long-context premium, and raised media limits to 600 images or PDF pages. Use it for retrieval-heavy and codebase-scale workflows that previously needed beta headers or special long-context pricing.