o1-mini

Name: o1-mini
Author: OpenAI

von OpenAI · Veroeffentlicht 2024-01-01

34.9

Durchschn. Score

N/A

Eingabepreis

N/A

Ausgabepreis

N/A

Kontextfenster

text

Typ

Tested on 13 benchmarks with 34.9% average. Top scores: Chatbot Arena Elo — Overall (1336.6%), MATH level 5 (89.2%), Aider — Code Editing (70.7%).

Benchmark-Ergebnisse

Benchmark	Kategorie	Score
Chatbot Arena Elo — Overall	arena	1336.6
MATH level 5	math	89.2
Aider — Code Editing	coding	70.7
Lech Mazur Writing	knowledge	64.9
GPQA diamond	knowledge	49.8
OTIS Mock AIME 2024-2025	math	46.9
WeirdML	coding	36.3
Aider polyglot	coding	32.9
ARC-AGI	reasoning	14.0
Cybench	coding	10.0
SimpleBench	reasoning	1.7
FrontierMath-2025-02-28-Private	math	1.7
ARC-AGI-2	reasoning	0.8