GPT-4.1

por OpenAI · Lançado em 2025-04-14

34.0

pontuação média

$2.00/1M

Preço de entrada

$8.00/1M

Preço de saída

1.0M tokens (~524 books)

Janela de contexto

multimodal

Tipo

Tested on 15 benchmarks with 34.0% average. Top scores: MATH level 5 (83.0%), GeoBench (72.0%), Fiction.LiveBench (63.9%).

Pontuações de benchmark

Benchmark	Categoria	Pontuação
MATH level 5	math	83.0
GeoBench	knowledge	72.0
Fiction.LiveBench	knowledge	63.9
GPQA diamond	knowledge	55.9
Aider polyglot	coding	52.4
CadEval	coding	42.0
SWE-Bench Verified (Bash Only)	coding	39.6
WeirdML	coding	39.0
OTIS Mock AIME 2024-2025	math	38.3
SimpleBench	reasoning	12.4
FrontierMath-2025-02-28-Private	math	5.5
ARC-AGI	reasoning	5.5
HLE	knowledge	0.6
ARC-AGI-2	reasoning	0.4
FrontierMath-Tier-4-2025-07-01-Private	math	0.1