Home/Models/Qwen2.5-Coder-3B
Alibaba Qwen logo

Qwen2.5-Coder-3B

by Alibaba Qwen · Released Jan 2024

Open Source
54.6
avg score
Rank #114
Compare
Better than 58% of all models
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
text
License
Open Source
Benchmarks
4 tested
Data updated today
About

Tested on 4 benchmarks with 52.2% average. Top scores: GSM8K (75.7%), HellaSwag (61.2%), ARC AI2 (37.2%).

Capabilities
math
75.7
#31 globally
knowledge
44.4
#154 globally
Benchmark Scores
Compare All
Tested on 4 benchmarks · Ranked across 2 categories
Score Distribution (all 274 models)
0255075100
▲ You are here
GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

75.7
HellaSwag

Sentence completion requiring commonsense reasoning about physical and social situations. Tests real-world understanding.

61.2
ARC AI2

AI2 Reasoning Challenge. Grade-school science questions requiring multi-step reasoning. Easy and Challenge sets test different difficulty levels.

37.2
Winogrande

Commonsense coreference resolution. Tests understanding of pronoun references in ambiguous sentences.

34.8
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
qwen2-5-coder-3b
Specifications
  • Typetext
  • ContextN/A
  • ReleasedJan 2024
  • LicenseOpen Source
  • Statusbenchmark-only
Available On
Alibaba Qwen logoAlibaba QwenTBD
Categories
Share & Export
Tweet
Qwen2.5-Coder-3B is an open-source text AI model by Alibaba Qwen, released in January 2024. It has an average benchmark score of 54.6.