Local Inference LLM Serving Multimodal Coding Agents Open Source

Llama 4 Maverick

Natively multimodal open model release

Meta's specific Llama 4 model release for native multimodal understanding and general assistant use.

Pricing

Model profile · Current snapshot

Input / 1M

$0.35

Output / 1M

$0.85

Blended / 1M

$0.475

Output TPS

122

TTFT (s)

0.62

Model Intelligence

Context window

1,000,000 tokens

Arena ranking

14

Benchmarkable

Yes

Model level

release

Intelligence Index

14.3

Coding Index

16.3

Math Index

19.3

MMLU Pro

0.81

GPQA

0.67

HLE

0.05

LiveCodeBench

0.4

SciCode

0.33

MATH-500

0.89

AIME

0.39

AIME 2025

0.19

IFBench

0.43

LCR

0.46

TerminalBench Hard

0.07

TAU2

0.18

Recent stories

0 linked stories

No linked stories yet.