베타
리더보드/R1 0528
DeepSeek logo

R1 0528

오픈소스

제공 DeepSeek · 출시일 2025-05-28

57.9
평균 점수
$0.50/1M
입력 가격
$2.15/1M
출력 가격
164K tokens (~82 books)
컨텍스트 윈도우
text
유형

Tested on 25 benchmarks with 57.9% average. Top scores: Chatbot Arena Elo — Overall (1421.7%), MATH level 5 (96.6%), OpenCompass — AIME2025 (89.0%).

벤치마크카테고리점수Bar
Chatbot Arena Elo — Overallarena1421.7
MATH level 5math96.6
OpenCompass — AIME2025math89.0
OpenCompass — MMLU-Proknowledge83.5
HELM — WildBenchreasoning82.8
OpenCompass — GPQA-Diamondknowledge80.6
OpenCompass — IFEvallanguage80.0
HELM — MMLU-Proknowledge79.3
HELM — IFEvallanguage78.4
Aider polyglotcoding71.4
GPQA diamondknowledge68.4
HELM — GPQAknowledge66.6
OTIS Mock AIME 2024-2025math66.4
OpenCompass — LiveCodeBenchV6coding61.0
HELM — Omni-MATHmath42.4
WeirdMLcoding41.6
DeepResearch Benchknowledge35.1
SimpleBenchreasoning29.0
SimpleQA Verifiedknowledge27.4
Artificial Analysis — Quality Indexspeed27.1
Artificial Analysis — Coding Indexspeed24.0
ARC-AGIreasoning21.2
Artificial Analysis — Agentic Indexspeed20.8
OpenCompass — HLEknowledge14.4
ARC-AGI-2reasoning1.1