updateApril 3, 2026

OpenRouter reports Qwen3.6-Plus hits 1.4T tokens in a day

OpenRouter said Qwen3.6-Plus became its first model to process more than 1 trillion tokens in a single day, and Alibaba said it reached the platform's top rank. The usage milestone adds adoption signal to the 1M-context launch and makes live code-arena comparisons more relevant.

3 min read

OpenRouter reports Qwen3.6-Plus hits 1.4T tokens in a day

TL;DR

OpenRouter said Qwen3.6-Plus became the first model on its platform to clear 1 trillion tokens in a single day, hitting about 1.4 trillion tokens on April 4 OpenRouter usage milestone.
Alibaba's Qwen account said the model also climbed to the top rank on OpenRouter shortly after launch Qwen on OpenRouter rank.
The underlying launch mattered because Qwen3.6-Plus arrived with a 1 million token context window, immediate API availability, and an agent-heavy pitch around coding and multimodal work Launch benchmark summary.
OpenRouter and Arena both pushed live trials fast: the model has a free OpenRouter endpoint OpenRouter usage milestone and is already in Code Arena for side by side coding battles against frontier models Code Arena listing.

You can read Alibaba's official launch post, skim the fuller Alibaba Cloud mirror, pull up the OpenRouter model card, or check the broader Qwen collection page, which already shows even higher cumulative token volume. There is also a live Code Arena entry for head to head coding tests.

OpenRouter's 1.4T day

OpenRouter framed the story as an adoption event, not just a launch. Its post said Qwen3.6-Plus processed roughly 1.4 trillion tokens in one day, the strongest full-day performance for any new model released this year on the platform.

Alibaba amplified that signal a few minutes later, saying Qwen3.6-Plus had already reached the top spot on OpenRouter. OpenRouter's own Qwen collection page now shows 1.81T cumulative tokens for the free model, which suggests the day-one spike kept compounding after the initial post.

Qwen3.6-Plus specs

Alibaba's launch post described Qwen3.6-Plus as a major step up from Qwen3.5 for real-world agents, with immediate API access and a default 1 million token context window. The OpenRouter model page repeats the same core pitch: hybrid linear attention plus sparse MoE routing, free access, and strength on repository-scale problem solving.

According to Alibaba's launch materials and the benchmark summary circulating alongside them, the headline numbers include:

78.8 on SWE-bench Verified
61.6 on Terminal-Bench 2.0
upgraded multimodal work on 3D scenes, GUIs, video reasoning, and visual-to-code tasks
integrations or optimization targets around agent workflows and coding tools

Code Arena testing

Arena moved quickly to turn the launch into a live comparison target. Its post says Qwen3.6-Plus is available in both Text and Code Arena, and the Code Arena framing is unusually concrete: real-world agentic web development tasks, with HTML or React apps you can share or download.

That makes the early benchmark chatter more interesting than the usual screenshot tour. One community post put Qwen3.6-Plus at 71.5 on the Extended NYT Connection Benchmark, far ahead of the two comparison scores in the same post, although that is a narrower unofficial signal than the vendor benchmarks.

Token efficiency chatter

One of the first community reactions was not about raw capability, but how expensive those capabilities feel in practice. A reply-post comparing Qwen3.5 27B with Gemma 4 31B argued that paper wins matter less if the model burns far more tokens to get there.