测试版
排行榜/INTELLECT-1
U

INTELLECT-1

来自 Unknown · 发布于 2024-01-01

20.2
平均分
N/A
输入价格
N/A
输出价格
N/A
上下文窗口
text
类型

Tested on 12 benchmarks with 20.2% average. Top scores: HellaSwag (61.9%), ARC AI2 (39.4%), GSM8K (38.6%).

基准测试类别分数Bar
HellaSwagknowledge61.9
ARC AI2knowledge39.4
GSM8Kmath38.6
MMLUknowledge33.2
Winograndeknowledge31.6
IFEvallanguage17.6
BBHreasoning13.1
MUSRreasoning4.1
MMLU-PROknowledge1.3
BBH (HuggingFace)general1.0
MATH Level 5math0.0
GPQAknowledge0.0