How much does Llama 3.1 8B Instruct cost?

Llama 3.1 8B Instruct costs $0.02 per million input tokens and $0.05 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.000 per message.

What benchmarks has Llama 3.1 8B Instruct been tested on?

Llama 3.1 8B Instruct has been evaluated on 16 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1211.0, GSM8K: 82.4, PIQA: 62.4.

Is Llama 3.1 8B Instruct open source?

Yes, Llama 3.1 8B Instruct is open source.

How does Llama 3.1 8B Instruct compare to Mistral Large 2407?

Llama 3.1 8B Instruct has an average score of 34.3 while Mistral Large 2407 scores 34.9. Mistral Large 2407 slightly outperforms Llama 3.1 8B Instruct overall. Llama 3.1 8B Instruct costs $0.02/1M input vs Mistral Large 2407 at $2.00/1M input. See full comparison →

Home/Models/Llama 3.1 8B Instruct

Llama 3.1 8B Instruct

Name: Llama 3.1 8B Instruct
Price: 0.02 USD
Author: Meta

by Meta · Released Jul 2024

Open Source

34.3

avg score

Rank #167

Compare

Better than 28% of all models

Context

16K tokens (~8 books)

Input $/1M

$0.02

Output $/1M

$0.05

Type

text

License

Open Source

Benchmarks

16 tested

Data updated today

About

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

Tested on 16 benchmarks with 27.4% average. Top scores: Chatbot Arena Elo — Overall (1211.0%), GSM8K (82.4%), PIQA (62.4%).

Looking for similar performance at lower cost?
Gemma 3 27B (free) scores 35.0 (102% as good) at $0.00/1M input · 100% cheaper

Capabilities

coding

19.7

#127 globally

reasoning

8.5

#145 globally

math

30.8

#129 globally

knowledge

26.8

#184 globally

general

29.2

#30 globally

language

50.6

#105 globally

Benchmark Scores

Compare All

Tested on 16 benchmarks · Ranked across 7 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

Aider — Code Editing

Code editing benchmark from the Aider project. Measures ability to apply targeted code changes while maintaining correctness and style.

37.6—

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

1.7—

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

8.5—

mathCompare math →

GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

82.4—

MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

22.9—

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

15.5—

Quick compare:

vs Mistral Large 2407

vs Gemma 3 27B (free)

vs GPT-4o-mini (2024-07-18)

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Meta Llama 3.1

Llama 3.1 405BJul 2024

38.0

N/AN/Actx21 benchmarks

Llama 3.1 70B InstructJul 2024

37.8-0.2

$0.40/M in131Kctx16 benchmarks

Llama 3.1 8B InstructJul 2024

27.4-10.4

$0.02/M in(-0.38)16Kctx(-115K)16 benchmarks

See the full Llama 3.1 family →

Similar Models

GPT-4o-mini (2024-07-18)

OpenAI

33.4$0.15/1M

Links

Info

Meta Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

llama-3-1-8b-instruct

Specifications

Typetext
Context16K tokens (~8 books)
ReleasedJul 2024
LicenseOpen Source
StatusActive
Cost / Message~$0.000

Available On

Meta$0.02

Frequently Asked Questions

Llama 3.1 8B Instruct is an open-source text AI model by Meta, released in July 2024. It has an average benchmark score of 34.3. Context window: 16K tokens.

Benchmarks

Chatbot Arena Elo — Overall GSM8K PIQA IFEval MMLU

Meta · Provider Meta · Economy All Models Compare Models Pricing Developers · API

Llama 3.1 8B Instruct

Frequently Asked Questions

Related Models

Benchmarks

Related Pages