베타
리더보드/Gemini 3.1 Pro Preview
Google DeepMind logo

Gemini 3.1 Pro Preview

제공 Google DeepMind · 출시일 2026-02-19

60.6
평균 점수
$2.00/1M
입력 가격
$12.00/1M
출력 가격
1.0M tokens (~524 books)
컨텍스트 윈도우
multimodal
유형

Tested on 23 benchmarks with 60.6% average. Top scores: Chatbot Arena Elo — Overall (1492.6%), Chatbot Arena Elo — Coding (1455.7%), ARC-AGI (98.0%).

벤치마크카테고리점수Bar
Chatbot Arena Elo — Overallarena1492.6
Chatbot Arena Elo — Codingarena1455.7
ARC-AGIreasoning98.0
OTIS Mock AIME 2024-2025math95.6
GPQA diamondknowledge92.1
Terminal Benchcoding78.4
SimpleQA Verifiedknowledge77.3
ARC-AGI-2reasoning77.1
SWE-Bench verifiedcoding75.6
SimpleBenchreasoning75.5
WeirdMLcoding72.1
MultiChallengeknowledge71.4
MultiNRCknowledge64.7
Artificial Analysis — Agentic Indexspeed59.1
Artificial Analysis — Quality Indexspeed57.2
Artificial Analysis — Coding Indexspeed55.5
Chess Puzzlesknowledge55.0
FrontierMath-2025-02-28-Privatemath36.9
APEX-Agentsagentic33.5
VisualToolBench (VTB)knowledge29.0
PostTrainBenchknowledge21.6
EnigmaEvalknowledge19.8
FrontierMath-Tier-4-2025-07-01-Privatemath16.7