How much does Qwen2 7B Instruct cost?

Qwen2 7B Instruct is open source and can be self-hosted.

What benchmarks has Qwen2 7B Instruct been tested on?

Qwen2 7B Instruct has been evaluated on 25 benchmarks. Top scores: JSQuAD: 89.6, JCommonsenseQA: 89.1, JNLI: 81.3.

Is Qwen2 7B Instruct open source?

Yes, Qwen2 7B Instruct is open source.

How does Qwen2 7B Instruct compare to Muse Spark?

Qwen2 7B Instruct has an average score of 77.7 while Muse Spark scores 77.0. Qwen2 7B Instruct outperforms Muse Spark overall. See full comparison →

Home/Models/Qwen2 7B Instruct

Qwen2 7B Instruct

Name: Qwen2 7B Instruct
Author: Alibaba

by Alibaba · Released Jun 2024

Open Source

77.7

avg score

Rank #24

Compare

Better than 90% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text-generation

License

Open Source

Benchmarks

25 tested

Data updated today

About

Qwen text generation model. 393K downloads on HuggingFace.

Tested on 25 benchmarks with 50.5% average. Top scores: JSQuAD (89.6%), JCommonsenseQA (89.1%), JNLI (81.3%).

Capabilities

reasoning

7.4

#150 globally

math

27.6

#140 globally

knowledge

19.0

#199 globally

language

57.6

#94 globally

general

37.8

#20 globally

Benchmark Scores

Compare All

Tested on 25 benchmarks · Ranked across 5 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

7.4—

mathCompare math →

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

27.6—

knowledgeCompare knowledge →

MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

31.6—

GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

6.4—

Quick compare:

vs Muse Spark

vs Gemini 2.5 Pro Preview 05-06

vs phi-3-small 7.4B

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Alibaba Qwen 2

Qwen2 0.5BMay 2024

7.2

N/AN/Actx6 benchmarks

Qwen2 0.5B InstructJun 2024

6.6-0.6

N/AN/Actx6 benchmarks

Qwen2 1.5B InstructJun 2024

14.1+7.5

N/AN/Actx6 benchmarks

Qwen2 7B InstructJun 2024

50.5+36.4

N/AN/Actx25 benchmarks

Qwen2 VL 2B InstructAug 2024

N/AN/Actx

Qwen2 VL 7B InstructAug 2024

47.3+47.3

N/AN/Actx11 benchmarks

Qwen2 VL 7B Instruct AWQAug 2024

N/AN/Actx

See the full Qwen 2 family →

Similar Models

Muse Spark

Unknown

77.0TBD

Gemini 2.5 Pro Preview 05-06

Links

Info

Research

Documentation

Community

Source Code

BenchGecko API

qwen-qwen2-7b-instruct

Specifications

Typetext-generation
ContextN/A
ReleasedJun 2024
LicenseOpen Source
StatusActive

Available On

AlibabaTBD

Frequently Asked Questions

Qwen2 7B Instruct is an open-source text-generation AI model by Alibaba, released in June 2024. It has an average benchmark score of 77.7.

Benchmarks

JSQuAD JCommonsenseQA JNLI MMMLU — Chinese MMMLU — French

Alibaba · Provider All Models Compare Models

Qwen2 7B Instruct

Frequently Asked Questions

Related Models

Benchmarks

Related Pages