测试版
排行榜/Gemma 2B
Google DeepMind logo

Gemma 2B

开源

来自 Google DeepMind · 发布于 2024-01-01

29.1
平均分
N/A
输入价格
N/A
输出价格
N/A
上下文窗口
text
类型

Tested on 16 benchmarks with 29.1% average. Top scores: OpenBookQA (71.5%), HellaSwag (61.9%), PIQA (54.6%).

基准测试类别分数Bar
OpenBookQAknowledge71.5
HellaSwagknowledge61.9
PIQAknowledge54.6
TriviaQAknowledge53.2
Winograndeknowledge30.8
IFEvallanguage26.6
MMLUknowledge23.1
ANLIknowledge23.1
ARC AI2knowledge22.8
MMLU-PROknowledge21.6
BBH (HuggingFace)general21.1
GSM8Kmath17.7
BBHreasoning13.6
MUSRreasoning11.0
MATH Level 5math7.4
GPQAknowledge4.9