测试版
Home/对比/DeepSeek V3 vs Llama 3.1 405B

DeepSeek V3 vs Llama 3.1 405B

并排对比,每项指标,每项基准测试。

DeepSeek
59.0
平均分
7/12
benchmarks
Meta
38.0
平均分
4/12
benchmarks
类型DeepSeek V3Llama 3.1 405B
ProviderDeepSeek logoDeepSeekMeta logoMeta
平均分59.038.0
输入价格$0.32-
输出价格$0.89-
上下文窗口164K tokens (~82 books)-
发布于2024-12-262024-07-16
开源Open SourceOpen Source

12 benchmarks · DeepSeek V3: 7, Llama 3.1 405B: 4

基准测试类别DeepSeek V3Llama 3.1 405B
ARC AI2knowledge93.793.7
BBHreasoning83.377.2
GPQA diamondknowledge42.034.5
HellaSwagknowledge85.285.6
MATH level 5math64.849.8
MMLUknowledge82.979.3
OTIS Mock AIME 2024-2025math15.89.6
PIQAknowledge69.471.8
SimpleBenchreasoning2.77.6
TriviaQAknowledge82.982.7
WeirdMLcoding36.121.4
Winograndeknowledge70.478.4