Beta
Clasificaci贸n/DeepSeek-R1 (May 2025)
DeepSeek

DeepSeek-R1 (May 2025)

por DeepSeekPublicado el 2024-01-01

48.5
puntuaci贸n promedio
N/A
Precio de entrada
N/A
Precio de salida
N/A
Ventana de contexto
text
Tipo

Tested on 11 benchmarks with 48.5% average. Top scores: MATH level 5 (96.6%), Fiction.LiveBench (75.0%), Aider polyglot (71.4%).

Puntuaciones de benchmark

BenchmarkCategor铆aPuntuaci贸nBar
MATH level 5math96.6
Fiction.LiveBenchknowledge75.0
Aider polyglotcoding71.4
GPQA diamondknowledge68.4
OTIS Mock AIME 2024-2025math66.4
WeirdMLcoding41.6
DeepResearch Benchknowledge35.1
SimpleBenchreasoning29.0
SimpleQA Verifiedknowledge27.4
ARC-AGIreasoning21.2
ARC-AGI-2reasoning1.1

Modelos similares