Beta
OpenAI

o3

por OpenAIPublicado el 2025-04-16

51.3
puntuaci贸n promedio
$2.00/1M
Precio de entrada
$8.00/1M
Precio de salida
200K tokens (~100 books)
Ventana de contexto
multimodal
Tipo

Tested on 21 benchmarks with 51.3% average. Top scores: MATH level 5 (97.8%), Fiction.LiveBench (88.9%), Lech Mazur Writing (83.9%).

Puntuaciones de benchmark

BenchmarkCategor铆aPuntuaci贸nBar
MATH level 5math97.8
Fiction.LiveBenchknowledge88.9
Lech Mazur Writingknowledge83.9
OTIS Mock AIME 2024-2025math83.9
Aider polyglotcoding81.3
GPQA diamondknowledge75.8
CadEvalcoding74.0
GeoBenchknowledge74.0
ARC-AGIreasoning60.8
SWE-Bench Verified (Bash Only)coding58.4
SimpleQA Verifiedknowledge53.0
WeirdMLcoding52.4
DeepResearch Benchknowledge46.6
SimpleBenchreasoning43.7
VPCTknowledge28.0
OSWorldagentic23.0
FrontierMath-2025-02-28-Privatemath18.7
HLEknowledge16.3
GSO-Benchcoding8.8
ARC-AGI-2reasoning6.5
FrontierMath-Tier-4-2025-07-01-Privatemath2.1

Modelos similares