测试版
排行榜/DeepSeek R1 Distill Qwen 7B
DeepSeek logo

DeepSeek R1 Distill Qwen 7B

开源

来自 DeepSeek · 发布于 2025-01-20

32.7
平均分
N/A
输入价格
N/A
输出价格
N/A
上下文窗口
text-generation
类型

Tested on 11 benchmarks with 32.7% average. Top scores: JSQuAD (74.2%), JCommonsenseQA (59.8%), JNLI (54.6%).

基准测试类别分数Bar
JSQuADlanguage74.2
JCommonsenseQAlanguage59.8
JNLIlanguage54.6
JMMLUlanguage42.3
IFEvallanguage40.4
LLM-JP — Overalllanguage39.3
MATH Level 5math19.6
MMLU-PROknowledge14.7
BBH (HuggingFace)general7.9
GPQAknowledge3.9
MUSRreasoning3.5