Home/Models/Qwen2.5 7B Instruct
Alibaba Qwen logo

Qwen2.5 7B Instruct

by Alibaba Qwen · Released Oct 2024

Open Source
57.4
avg score
Rank #79
Compare
Better than 66% of all models
Context
33K tokens (~16 books)
Input $/1M
$0.04
Output $/1M
$0.10
Type
text
License
Open Source
Benchmarks
6 tested
Data updated today
About

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

Tested on 6 benchmarks with 35.2% average. Top scores: IFEval (75.8%), MATH Level 5 (50.0%), MMLU-PRO (36.5%).

Capabilities
reasoning
8.4
#147 globally
math
50.0
#76 globally
knowledge
21.0
#196 globally
language
75.8
#56 globally
general
34.9
#28 globally
Benchmark Scores
Compare All
Tested on 6 benchmarks · Ranked across 5 categories
Score Distribution (all 233 models)
0255075100
▲ You are here
MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

8.4
MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

50.0
MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

36.5
GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

5.5
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
qwen-2-5-7b-instruct
Specifications
  • Typetext
  • Context33K tokens (~16 books)
  • ReleasedOct 2024
  • LicenseOpen Source
  • StatusActive
  • Cost / Message~$0.000
Available On
Alibaba Qwen logoAlibaba Qwen$0.04
Share & Export
Tweet
Qwen2.5 7B Instruct is an open-source text AI model by Alibaba Qwen, released in October 2024. It has an average benchmark score of 57.4. Context window: 33K tokens.