How much does Llama 4 Maverick cost?

Llama 4 Maverick costs $0.15 per million input tokens and $0.60 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.001 per message.

What benchmarks has Llama 4 Maverick been tested on?

Llama 4 Maverick has been evaluated on 17 benchmarks. Top scores: MATH level 5: 73.0, Lech Mazur Writing: 63.7, GPQA diamond: 56.0.

Is Llama 4 Maverick open source?

Yes, Llama 4 Maverick is open source.

How does Llama 4 Maverick compare to Llama 2 7b Hf?

Llama 4 Maverick has an average score of 22.0 while Llama 2 7b Hf scores 22.2. Llama 2 7b Hf slightly outperforms Llama 4 Maverick overall. See full comparison →

Home/Models/Llama 4 Maverick

Llama 4 Maverick

Name: Llama 4 Maverick
Price: 0.15 USD
Author: Meta

by Meta · Released Apr 2025

Open SourceMultimodal1M Context

22.0

avg score

Rank #203

Compare

Better than 13% of all models

Context

1.0M tokens (~524 books)

Input $/1M

$0.15

Output $/1M

$0.60

Type

multimodal

License

Open Source

Benchmarks

17 tested

Data updated today

About

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

Tested on 17 benchmarks with 28.0% average. Top scores: MATH level 5 (73.0%), Lech Mazur Writing (63.7%), GPQA diamond (56.0%).

Looking for similar performance at lower cost?
Llama 3.2 1B Instruct scores 19.9 (90% as good) at $0.03/1M input · 82% cheaper

Capabilities

coding

20.4

#126 globally

reasoning

5.9

#160 globally

math

31.4

#127 globally

knowledge

43.8

#130 globally

speed

22.9

#52 globally

Benchmark Scores

Compare All

Tested on 17 benchmarks · Ranked across 5 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

24.5—

SWE-Bench Verified (Bash Only)

SWE-bench Verified solved using only bash commands, no specialized frameworks. Tests raw terminal-based problem solving.

21.0—

Aider polyglot

Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.

15.6—

reasoningCompare reasoning →

SimpleBench

Deceptively simple questions that humans find easy but AI models often get wrong. Tests common sense and reasoning gaps.

13.2—

ARC-AGI

Abstraction and Reasoning Corpus. Tests fluid intelligence through novel visual pattern recognition puzzles. Core measure of general intelligence.

4.4—

ARC-AGI-2

ARC-AGI 2, harder sequel to ARC. More complex abstract reasoning patterns that test generalization ability beyond training data.

0.1—

mathCompare math →

MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

73.0—

OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

20.5—

FrontierMath-2025-02-28-Private

Original research-level math problems created by professional mathematicians. Problems are unpublished and cannot be memorized.

0.7—

Quick compare:

vs Llama 2 7b Hf

vs Mistral Small 3.1 24B

vs Gemini 1.0 Pro

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Meta Llama 4

Llama 4 MaverickApr 2025

28.0

$0.15/M in1.0Mctx17 benchmarks

Llama 4 ScoutApr 2025

18.9-9.1

$0.08/M in(-0.07)328Kctx(-721K)11 benchmarks

Llama 4 Scout 17B 16E InstructApr 2025

N/AN/Actx1 benchmark

See the full Llama 4 family →

Similar Models

Llama 2 7b Hf

Frequently Asked Questions

Llama 4 Maverick is an open-source multimodal AI model by Meta, released in April 2025. It has an average benchmark score of 22.0. Context window: 1M tokens.

Benchmarks

MATH level 5 Lech Mazur Writing GPQA diamond GeoBench Fiction.LiveBench

Meta · Provider Meta · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

Llama 4 Maverick

Frequently Asked Questions

Related Models

Benchmarks

Related Pages