releaseJune 1, 2026

Qwen releases Qwen 3.7 Plus with multimodal agent mode and browser demos

Alibaba released Qwen 3.7 Plus as a multimodal agent model for GUI, CLI, coding, and browser tasks. It ships with browser demos and immediate Cline support, giving teams another frontier-style agent model to compare against M3 and closed-source tools.

4 min read

Qwen releases Qwen 3.7 Plus with multimodal agent mode and browser demos

TL;DR

Alibaba_Qwen's launch post says Qwen 3.7 Plus is a new multimodal agent model that combines vision and language for GUI, CLI, coding, productivity, and search-augmented visual QA, and Alibaba's Model Studio catalog lists it with a 1M-token context window.
The launch was demo-heavy: Alibaba_Qwen's hybrid-agent demo shows mixed GUI and CLI operation, while Alibaba_Qwen's browser-agent demo pushes it into browser workflows.
Alibaba is positioning Plus as the cheaper agentic sibling to Max. The Model Studio catalog describes Qwen3.7-Plus as the cost-effective model in the 3.7 line, and teortaxesTex's benchmark note called out a new CritPt score of 6.0, above Qwen 3.6 Plus but far below 3.7 Max.
Day-one availability was immediate outside Alibaba's own stack: cline's integration post made it available in Cline, and vercel_dev's AI Gateway post added both Qwen 3.7 Plus and Max to Vercel AI Gateway with a free window for paid users through June 4.

Alibaba is pitching Qwen 3.7 Plus less like a chat model and more like a single agent loop that can look at a screen, drop into a terminal, and keep moving. You can already see that framing in the browser demo on X, the Vercel model page, and the day-one AI Gateway changelog.

Qwen3.7-Plus

The official pitch is unusually specific. According to Alibaba_Qwen's launch post, Qwen 3.7 Plus is a multimodal agent foundation with four named buckets:

multimodal interactive hybrid agent for GUI and CLI tasks
coding agent and productivity assistant with full-modality input
visual agent for perception, reasoning, grounding, and search-augmented QA
cross-harness generalization across agent frameworks

Alibaba's Model Studio catalog adds two concrete deployment details the launch tweet skips: Qwen3.7-Plus is listed with a 1M-token context window, and the product card frames Plus as the cost-effective model in the Qwen 3.7 family.

Hybrid agent demos

The first two public demos match the launch claim that GUI and CLI are meant to sit in the same loop. Alibaba_Qwen's hybrid-agent demo is explicitly labeled "Multimodal Interactive Hybrid Agent," while Alibaba_Qwen's browser-agent demo shows the same release being pushed into web navigation.

That lines up with how Vercel describes the model on its Qwen 3.7 Plus page: vision-language upgrades are paired with agent-level coding, tool use, and productivity workflows, rather than split across separate model variants.

Benchmark framing

Alibaba's public benchmark card landed fast enough that even early reactions focused as much on the comparison set as the numbers. kimmonismus's reaction post questioned why the chart compares Qwen 3.7 Plus against model names like GPT-5.4 and Opus 4.6, while teortaxesTex's benchmark note highlighted a CritPt score of 6.0 and framed it as a mid-tier step between 3.6 Plus and 3.7 Max.

The useful read here is product segmentation, not just leaderboard placement. The Model Studio catalog and Vercel's model page both present Plus as the cheaper multimodal agent model, with Max still sitting above it on raw capability.

Where it shows up

The quickest external pickup came from tooling, not press coverage. cline's integration post said paid and free users could try Qwen 3.7 Plus in Cline immediately, and vercel_dev's AI Gateway post exposed the model slug for Vercel's AI SDK while offering Qwen 3.7 Plus and Max free to paid AI Gateway users until 12 PM PT on June 4.

Vercel's changelog entry names the deployment slug as alibaba/qwen-3.7-plus, although the same page's code sample uses alibaba/qwen3.7-plus. That mismatch is minor, but it is the kind of day-one integration detail engineers notice first.

TL;DR

Qwen3.7-Plus

Hybrid agent demos

Benchmark framing

Where it shows up

Discussion across the web