Beta
Rangliste/DeepSeek V3
DeepSeek logo

DeepSeek V3

Open Source

von DeepSeek · Veroeffentlicht 2024-12-26

59.0
Durchschn. Score
$0.32/1M
Eingabepreis
$0.89/1M
Ausgabepreis
164K tokens (~82 books)
Kontextfenster
text
Typ

Tested on 22 benchmarks with 59.0% average. Top scores: Chatbot Arena Elo — Overall (1358.2%), ARC AI2 (93.7%), HellaSwag (85.2%).

BenchmarkKategorieScoreBar
Chatbot Arena Elo — Overallarena1358.2
ARC AI2knowledge93.7
HellaSwagknowledge85.2
BBHreasoning83.3
HELM — IFEvallanguage83.2
HELM — WildBenchreasoning83.1
MMLUknowledge82.9
TriviaQAknowledge82.9
Lech Mazur Writingknowledge77.0
HELM — MMLU-Proknowledge72.3
Winograndeknowledge70.4
PIQAknowledge69.4
MATH level 5math64.8
HELM — GPQAknowledge53.8
Fiction.LiveBenchknowledge50.0
Aider polyglotcoding48.4
GPQA diamondknowledge42.0
HELM — Omni-MATHmath40.3
WeirdMLcoding36.1
OTIS Mock AIME 2024-2025math15.8
SimpleBenchreasoning2.7
FrontierMath-2025-02-28-Privatemath1.7