GPT-5 Chat

por OpenAI · Lançado em 2025-08-07

53.7

pontuação média

$1.25/1M

Preço de entrada

$10.00/1M

Preço de saída

128K tokens (~64 books)

Janela de contexto

multimodal

Tipo

Tested on 23 benchmarks with 53.7% average. Top scores: MATH level 5 (98.1%), Fiction.LiveBench (97.2%), OTIS Mock AIME 2024-2025 (91.4%).

Pontuações de benchmark

Benchmark	Categoria	Pontuação
MATH level 5	math	98.1
Fiction.LiveBench	knowledge	97.2
OTIS Mock AIME 2024-2025	math	91.4
Aider polyglot	coding	88.0
Lech Mazur Writing	knowledge	87.2
GPQA diamond	knowledge	81.6
GeoBench	knowledge	81.0
ARC-AGI	reasoning	65.7
SWE-Bench Verified (Bash Only)	coding	65.0
WeirdML	coding	60.7
DeepResearch Bench	knowledge	51.0
SimpleQA Verified	knowledge	50.6
Terminal Bench	coding	49.6
VPCT	knowledge	49.0
SimpleBench	reasoning	48.0
Chess Puzzles	knowledge	37.0
Balrog	knowledge	32.8
FrontierMath-2025-02-28-Private	math	32.4
HLE	knowledge	21.6
APEX-Agents	agentic	18.3
FrontierMath-Tier-4-2025-07-01-Private	math	12.5
ARC-AGI-2	reasoning	9.9
GSO-Bench	coding	6.9