Beta
Classificação/GPT-5.1 Chat
OpenAI

GPT-5.1 Chat

por OpenAI · Lançado em 2025-11-13

43.4
pontuação média
$1.25/1M
Preço de entrada
$10.00/1M
Preço de saída
128K tokens (~64 books)
Janela de contexto
multimodal
Tipo

Tested on 16 benchmarks with 43.4% average. Top scores: OTIS Mock AIME 2024-2025 (88.6%), GPQA diamond (83.5%), ARC-AGI (72.8%).

Pontuações de benchmark

BenchmarkCategoriaPontuaçãoBar
OTIS Mock AIME 2024-2025math88.6
GPQA diamondknowledge83.5
ARC-AGIreasoning72.8
SWE-Bench Verified (Bash Only)coding66.0
WeirdMLcoding60.8
SimpleQA Verifiedknowledge48.9
Terminal Benchcoding47.6
SimpleBenchreasoning43.8
VPCTknowledge38.0
Chess Puzzlesknowledge32.0
FrontierMath-2025-02-28-Privatemath31.0
HLEknowledge19.8
ARC-AGI-2reasoning17.6
APEX-Agentsagentic17.5
GSO-Benchcoding13.7
FrontierMath-Tier-4-2025-07-01-Privatemath12.5

Modelos similares