Home/Models/StarCoder 2 15B
U

StarCoder 2 15B

by Unknown · Released Jan 2024

Open Source
30.5
avg score
Rank #177
Compare
Better than 24% of all models
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
text
License
Open Source
Benchmarks
10 tested
Data updated today
About

Tested on 10 benchmarks with 24.3% average. Top scores: GSM8K (57.7%), MMLU (52.1%), ARC AI2 (29.6%).

Capabilities
reasoning
2.9
#174 globally
math
31.8
#124 globally
knowledge
25.7
#187 globally
language
27.8
#127 globally
general
20.4
#41 globally
Benchmark Scores
Compare All
Tested on 10 benchmarks · Ranked across 5 categories
Score Distribution (all 233 models)
0255075100
▲ You are here
MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

2.9
GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

57.7
MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

6.0
MMLU

Massive Multitask Language Understanding. 57 subjects from STEM, humanities, and social sciences. The most widely-cited knowledge benchmark.

52.1
ARC AI2

AI2 Reasoning Challenge. Grade-school science questions requiring multi-step reasoning. Easy and Challenge sets test different difficulty levels.

29.6
Winogrande

Commonsense coreference resolution. Tests understanding of pronoun references in ambiguous sentences.

28.6
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
starcoder-2-15b
Specifications
  • Typetext
  • ContextN/A
  • ReleasedJan 2024
  • LicenseOpen Source
  • Statusbenchmark-only
Available On
U
UnknownTBD
Share & Export
Tweet
StarCoder 2 15B is an open-source text AI model by Unknown, released in January 2024. It has an average benchmark score of 30.5.