测试版
排行榜/Qwen2-72B
Alibaba Qwen logo

Qwen2-72B

开源

来自 Alibaba Qwen · 发布于 2024-01-01

41.3
平均分
N/A
输入价格
N/A
输出价格
N/A
上下文窗口
text
类型

Tested on 12 benchmarks with 41.3% average. Top scores: CMMLU (89.7%), MMLU (76.5%), Aider — Code Editing (55.6%).

基准测试类别分数Bar
CMMLUknowledge89.7
MMLUknowledge76.5
Aider — Code Editingcoding55.6
MMLU-PROknowledge52.6
BBH (HuggingFace)general51.9
MATH level 5math39.1
IFEvallanguage38.2
MATH Level 5math31.1
GPQA diamondknowledge21.0
MUSRreasoning19.7
GPQAknowledge19.2
The Agent Companyagentic1.1