MiniMax's multimodal AI model family.

Pricing

Model profile · Current snapshot

Input / 1M

$0.30

Output / 1M

$1.20

Blended / 1M

$0.525

Output TPS

50.73

TTFT (s)

1.52

Model Intelligence

Arena ranking

Benchmarkable

Model level

family

Intelligence Index

49.6

Coding Index

41.9

GPQA

0.87

HLE

0.28

SciCode

0.47

IFBench

0.76

LCR

0.69

TerminalBench Hard

0.39

TAU2

0.85

Recent stories

13 linked stories

releaseSECONDARY2026-06-14

Kilo releases Product Week bundle with Agent Manager, Console beta, and M3 plan

Kilo's Product Week bundle added Agent Manager for isolated git worktrees, Kilo Console beta, REVIEWS.md memory hooks, and a balance-based MiniMax M3 plan. The bundle puts parallel agent runs, browser control, and plan provisioning into one shipped release.

releasePRIMARY2026-06-12

MiniMax opens M3 weights: 428B total, 23B active, 1M context

MiniMax published M3 weights on Hugging Face with 428B total parameters, 23B active parameters, 1M context, and multimodal support. Unsloth quickly added local GGUF builds, so teams can try 2-bit runs at 138GB RAM or VRAM and 3-bit at 165GB.

newsPRIMARY2026-06-06

Kilo Code benchmarks MiniMax M3 vs Claude Opus 4.8: 13/17 bugs at $0.07 vs $1.30

A seeded code-audit benchmark found MiniMax M3 and the cheapest Claude Opus 4.8 run each caught 13 of 17 planted bugs, but at sharply different cost. The results also showed models found different bugs, and higher reasoning settings did not reliably improve cost efficiency.

releaseSECONDARY2026-06-03

OpenClaw 2026.6.1 adds native Windows node host, Skill Workshop, and Workboard orchestration

OpenClaw 2026.6.1 added a native Windows node host, a Skill Workshop for reviewable agent-learned skills, and Workboard orchestration. The update extends OpenClaw beyond Unix-heavy setups and moves more agent management into built-in tools.

newsPRIMARY2026-06-01

MiniMax M3 adds OpenCode, Hermes Agent, Atomic Chat, and Vercel AI Gateway support

A day after MiniMax M3 launched, OpenCode, Hermes Agent, Flowith, Atomic Chat, Kilo Code, Cloudflare AI Gateway, and Vercel AI Gateway shipped support. That breadth shows M3 plugged into agent harnesses and routing layers immediately, not just its own API.

newsPRIMARY2026-06-01

MiniMax M3 users report slow runs and broken code after launch

A day after MiniMax M3 launched, independent testers posted mixed results: cheap demos and design tasks worked, but several coding runs stalled, broke features, or used more tokens than expected. New external numbers added nuance, with Context Arena falling sharply after 64k context and one DeepSWE run passing 15 of 113 tasks.

releasePRIMARY2026-05-31

MiniMax M3 launches with 1M context and 59.0 SWE-Bench Pro

MiniMax shipped M3 with a 1M-token context window, native multimodal input, and frontier coding claims across SWE-Bench Pro, Terminal Bench, and MCP Atlas. It also appeared on OpenRouter, Ollama Cloud, Venice, Hermes, Cline, Together, and Arena on day one.

newsPRIMARY2026-05-26

MiniMax claims M3 sparse attention cuts 1M-token prefill 9.7x and decode 15.6x

MiniMax started winding down its M2 series while previewing M3 and a new sparse-attention design with large long-context speedup claims. The teaser points to a fresh open-model race around block selection, GQA, and million-token serving efficiency.

releasePRIMARY2026-04-12

MiniMax M2.7 supports 128 GB GGUF runs and day-0 cloud hosting

MiniMax M2.7 moved from announcement to deployment, with GGUF guidance for 128 GB local systems and same-day availability on Together, Fireworks, Hugging Face, and ModelScope. Use the local and managed serving options now, but check the non-commercial license before adopting the 230B model.

releasePRIMARY2026-04-11

MiniMax releases M2.7 open model with 56.22% SWE-Pro and 57.0% Terminal Bench 2

MiniMax open-sourced M2.7 and published coding and agent benchmark claims including 56.22% SWE-Pro and 57.0% Terminal Bench 2. Day-zero support from SGLang, vLLM, Ollama Cloud, Together AI, and NVIDIA NIM makes it easy to try on common serving stacks.

newsSECONDARY2026-04-07

Hermes Agent adds MiniMax M2.7 and MiMo V2 Pro through partner integrations

Nous Research added MiniMax M2.7, Xiaomi’s MiMo V2 Pro, a SuperMemory plugin, and expanded Manim support to Hermes through partner integrations. The additions give users new hosted model options, a shared memory backend, and more complete technical-animation tooling to try in workflows.

newsPRIMARY2026-03-23

MiniMax introduces Token Plan for flat-rate text, speech, music, video, and image APIs

MiniMax introduced a flat-rate Token Plan that covers text, speech, music, video, and image APIs under one subscription. It gives teams one predictable bill across modalities and can be used in third-party harnesses, not just MiniMax apps.

releasePRIMARY2026-03-22

MiniMax M2.7 reportedly opens weights in about 2 weeks

Skyler Miao said MiniMax M2.7 open weights are due in roughly two weeks, with updates tuned for agent tasks. Separate replies also confirm multimodal M3, so local-stack builders should watch both the drop and the benchmark setup.