Beta
Classifica/Gemini 3 Pro
Google DeepMind logo

Gemini 3 Pro

di Google DeepMind · Rilascio 2024-01-01

60.5
punteggio medio
N/A
Prezzo Input
N/A
Prezzo Output
N/A
Finestra di Contesto
text
Tipo

Tested on 28 benchmarks with 60.5% average. Top scores: Chatbot Arena Elo — Overall (1486.2%), Chatbot Arena Elo — Coding (1437.6%), OTIS Mock AIME 2024-2025 (91.4%).

BenchmarkCategoriaPunteggioBar
Chatbot Arena Elo — Overallarena1486.2
Chatbot Arena Elo — Codingarena1437.6
OTIS Mock AIME 2024-2025math91.4
HELM — MMLU-Proknowledge90.3
GPQA diamondknowledge90.2
HELM — IFEvallanguage87.6
VPCTknowledge86.5
HELM — WildBenchreasoning85.9
GeoBenchknowledge84.0
HELM — GPQAknowledge80.3
ARC-AGIreasoning75.0
SWE-Bench verifiedcoding72.9
SimpleQA Verifiedknowledge72.9
SimpleBenchreasoning71.7
WeirdMLcoding69.9
Terminal Benchcoding69.4
HELM — Omni-MATHmath55.6
Artificial Analysis — Agentic Indexspeed45.0
Artificial Analysis — Quality Indexspeed41.3
Artificial Analysis — Coding Indexspeed39.4
FrontierMath-2025-02-28-Privatemath37.6
HLEknowledge34.4
ARC-AGI-2reasoning31.1
Chess Puzzlesknowledge31.0
FrontierMath-Tier-4-2025-07-01-Privatemath18.8
GSO-Benchcoding18.6
APEX-Agentsagentic18.4
PostTrainBenchknowledge18.1