How much does Llama 2 7b Hf cost?

Llama 2 7b Hf is open source and can be self-hosted.

What benchmarks has Llama 2 7b Hf been tested on?

Llama 2 7b Hf has been evaluated on 11 benchmarks. Top scores: JSQuAD: 79.9, LLM-JP — Overall: 37.2, JNLI: 36.1.

Is Llama 2 7b Hf open source?

Yes, Llama 2 7b Hf is open source.

How does Llama 2 7b Hf compare to Llama 4 Maverick?

Llama 2 7b Hf has an average score of 22.2 while Llama 4 Maverick scores 22.0. Llama 2 7b Hf outperforms Llama 4 Maverick overall. See full comparison →

Home/Models/Llama 2 7b Hf

Llama 2 7b Hf

Name: Llama 2 7b Hf
Author: Meta

by Meta · Released Jul 2023

Open Source

22.2

avg score

Rank #200

Compare

Better than 13% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text-generation

License

Open Source

Benchmarks

11 tested

Data updated today

About

Meta-llama text generation model. 865K downloads on HuggingFace.

Tested on 11 benchmarks with 23.6% average. Top scores: JSQuAD (79.9%), LLM-JP — Overall (37.2%), JNLI (36.1%).

Capabilities

reasoning

3.8

#166 globally

math

1.7

#208 globally

knowledge

5.9

#216 globally

language

38.7

#114 globally

general

10.3

#50 globally

Benchmark Scores

Compare All

Tested on 11 benchmarks · Ranked across 5 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

3.8—

mathCompare math →

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

1.7—

knowledgeCompare knowledge →

MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

9.6—

GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

2.2—

Quick compare:

vs Llama 4 Maverick

vs Mistral Small 3.1 24B

vs Gemini 1.0 Pro

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Meta Llama 2

Llama 2 7b Chat HfJul 2023

27.3

N/AN/Actx11 benchmarks

Llama 2 7b HfJul 2023

23.6-3.7

N/AN/Actx11 benchmarks

Llama 2-13BJan 2024

42.5+18.9

N/AN/Actx14 benchmarks

See the full Llama 2 family →

Similar Models

Llama 4 Maverick

Frequently Asked Questions

Llama 2 7b Hf is an open-source text-generation AI model by Meta, released in July 2023. It has an average benchmark score of 22.2.

Benchmarks

JSQuAD LLM-JP — Overall JNLI JMMLU JCommonsenseQA

Meta · Provider All Models Compare Models

Llama 2 7b Hf

Frequently Asked Questions

Related Models

Benchmarks

Related Pages