Beta
Home/Models/Qwen2.5 14B Instruct
Alibaba logo

Qwen2.5 14B Instruct

by Alibaba · Released Sep 2024

Open Source
71.3
avg score
Rank #36
Compare
Better than 84% of all models
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
text-generation
License
Open Source
Benchmarks
6 tested
Data updated today
About

Qwen text generation model. 1503K downloads on HuggingFace.

Tested on 6 benchmarks with 41.6% average. Top scores: IFEval (81.4%), MATH Level 5 (55.3%), BBH (HuggingFace) (48.6%).

Capabilities
reasoning
10.6
#132 globally
math
55.3
#62 globally
knowledge
26.9
#181 globally
language
81.4
#47 globally
general
48.6
#13 globally
Benchmark Scores
Compare All
Tested on 6 benchmarks · Ranked across 5 categories
Score Distribution (all 231 models)
0255075100
▲ You are here
MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

10.6
MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

55.3
MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

43.2
GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

10.5
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
qwen-qwen25-14b-instruct
Specifications
  • Typetext-generation
  • ContextN/A
  • ReleasedSep 2024
  • LicenseOpen Source
  • StatusActive
Available On
Alibaba logoAlibabaTBD
Share & Export
Tweet
Qwen2.5 14B Instruct is an open-source text-generation AI model by Alibaba, released in September 2024. It has an average benchmark score of 71.3.