测试版
排行榜/DeepSeek R1 Distill Qwen 14B
DeepSeek logo

DeepSeek R1 Distill Qwen 14B

开源

来自 DeepSeek · 发布于 2025-01-20

56.0
平均分
N/A
输入价格
N/A
输出价格
N/A
上下文窗口
text-generation
类型

Tested on 11 benchmarks with 56.0% average. Top scores: JCommonsenseQA (93.7%), JSQuAD (89.8%), JNLI (82.4%).

基准测试类别分数Bar
JCommonsenseQAlanguage93.7
JSQuADlanguage89.8
JNLIlanguage82.4
JMMLUlanguage63.4
MATH Level 5math57.0
LLM-JP — Overalllanguage56.8
IFEvallanguage43.8
MMLU-PROknowledge40.7
BBH (HuggingFace)general40.7
MUSRreasoning28.7
GPQAknowledge18.3