How much does Llama 3.1 70B Instruct cost?

Llama 3.1 70B Instruct costs $0.40 per million input tokens and $0.40 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.001 per message.

What benchmarks has Llama 3.1 70B Instruct been tested on?

Llama 3.1 70B Instruct has been evaluated on 16 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1292.8, IFEval: 86.7, MMLU: 73.5.

Is Llama 3.1 70B Instruct open source?

Yes, Llama 3.1 70B Instruct is open source.

How does Llama 3.1 70B Instruct compare to DeepSeek V3.1?

Llama 3.1 70B Instruct has an average score of 53.8 while DeepSeek V3.1 scores 53.4. Llama 3.1 70B Instruct outperforms DeepSeek V3.1 overall. Llama 3.1 70B Instruct costs $0.40/1M input vs DeepSeek V3.1 at $0.15/1M input. See full comparison →

Home/Models/Llama 3.1 70B Instruct

Llama 3.1 70B Instruct

Name: Llama 3.1 70B Instruct
Price: 0.4 USD
Author: Meta

by Meta · Released Jul 2024

Open Source

53.8

avg score

Rank #93

Compare

Better than 60% of all models

Context

131K tokens (~66 books)

Input $/1M

$0.40

Output $/1M

$0.40

Type

text

License

Open Source

Benchmarks

16 tested

Data updated today

About

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

Tested on 16 benchmarks with 37.8% average. Top scores: Chatbot Arena Elo — Overall (1292.8%), IFEval (86.7%), MMLU (73.5%).

Looking for similar performance at lower cost?
Phi 4 scores 54.2 (101% as good) at $0.07/1M input · 84% cheaper

Capabilities

coding

33.8

#111 globally

reasoning

17.7

#104 globally

math

26.1

#143 globally

knowledge

42.2

#138 globally

agentic

6.9

#30 globally

general

55.9

#5 globally

language

86.7

#26 globally

Benchmark Scores

Compare All

Tested on 16 benchmarks · Ranked across 8 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

Aider — Code Editing

Code editing benchmark from the Aider project. Measures ability to apply targeted code changes while maintaining correctness and style.

58.6—

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

9.0—

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

17.7—

mathCompare math →

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

38.1—

MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

36.7—

OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

3.5—

Quick compare:

vs DeepSeek V3.1

vs LongCat Flash Chat

vs Phi 4

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Meta Llama 3.1

Llama 3.1 405BJul 2024

38.0

N/AN/Actx21 benchmarks

Llama 3.1 70B InstructJul 2024

37.8-0.2

$0.40/M in131Kctx16 benchmarks

Llama 3.1 8B InstructJul 2024

27.4-10.4

$0.02/M in(-0.38)16Kctx(-115K)16 benchmarks

See the full Llama 3.1 family →

Similar Models

Links

Info

Meta Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

llama-3-1-70b-instruct

Specifications

Typetext
Context131K tokens (~66 books)
ReleasedJul 2024
LicenseOpen Source
StatusActive
Cost / Message~$0.001

Available On

Meta$0.40

Frequently Asked Questions

Llama 3.1 70B Instruct is an open-source text AI model by Meta, released in July 2024. It has an average benchmark score of 53.8. Context window: 131K tokens.

Benchmarks

Chatbot Arena Elo — Overall IFEval MMLU CMMLU Aider — Code Editing

Meta · Provider Meta · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

Llama 3.1 70B Instruct

Frequently Asked Questions

Related Models

Benchmarks

Related Pages