测试版
排行榜/DeepSeek-V2 (MoE-236B, May 2024)
DeepSeek logo

DeepSeek-V2 (MoE-236B, May 2024)

开源

来自 DeepSeek · 发布于 2024-01-01

76.5
平均分
N/A
输入价格
N/A
输出价格
N/A
上下文窗口
text
类型

Tested on 7 benchmarks with 76.5% average. Top scores: ARC AI2 (89.6%), HellaSwag (82.8%), TriviaQA (80.0%).

基准测试类别分数Bar
ARC AI2knowledge89.6
HellaSwagknowledge82.8
TriviaQAknowledge80.0
Winograndeknowledge72.6
BBHreasoning71.7
MMLUknowledge71.2
PIQAknowledge67.8