베타
리더보드/Gemini 3 Pro
Google DeepMind logo

Gemini 3 Pro

제공 Google DeepMind · 출시일 2024-01-01

60.5
평균 점수
N/A
입력 가격
N/A
출력 가격
N/A
컨텍스트 윈도우
text
유형

Tested on 28 benchmarks with 60.5% average. Top scores: Chatbot Arena Elo — Overall (1486.2%), Chatbot Arena Elo — Coding (1437.6%), OTIS Mock AIME 2024-2025 (91.4%).

벤치마크카테고리점수Bar
Chatbot Arena Elo — Overallarena1486.2
Chatbot Arena Elo — Codingarena1437.6
OTIS Mock AIME 2024-2025math91.4
HELM — MMLU-Proknowledge90.3
GPQA diamondknowledge90.2
HELM — IFEvallanguage87.6
VPCTknowledge86.5
HELM — WildBenchreasoning85.9
GeoBenchknowledge84.0
HELM — GPQAknowledge80.3
ARC-AGIreasoning75.0
SWE-Bench verifiedcoding72.9
SimpleQA Verifiedknowledge72.9
SimpleBenchreasoning71.7
WeirdMLcoding69.9
Terminal Benchcoding69.4
HELM — Omni-MATHmath55.6
Artificial Analysis — Agentic Indexspeed45.0
Artificial Analysis — Quality Indexspeed41.3
Artificial Analysis — Coding Indexspeed39.4
FrontierMath-2025-02-28-Privatemath37.6
HLEknowledge34.4
ARC-AGI-2reasoning31.1
Chess Puzzlesknowledge31.0
FrontierMath-Tier-4-2025-07-01-Privatemath18.8
GSO-Benchcoding18.6
APEX-Agentsagentic18.4
PostTrainBenchknowledge18.1