测试版
排行榜/Gemini 3 Flash Preview
Google DeepMind logo

Gemini 3 Flash Preview

来自 Google DeepMind · 发布于 2025-12-17

49.1
平均分
$0.50/1M
输入价格
$3.00/1M
输出价格
1.0M tokens (~524 books)
上下文窗口
multimodal
类型

Tested on 24 benchmarks with 49.1% average. Top scores: Chatbot Arena Elo — Overall (1473.9%), Chatbot Arena Elo — Coding (1436.4%), OTIS Mock AIME 2024-2025 (92.8%).

基准测试类别分数Bar
Chatbot Arena Elo — Overallarena1473.9
Chatbot Arena Elo — Codingarena1436.4
OTIS Mock AIME 2024-2025math92.8
GeoBenchknowledge88.0
GPQA diamondknowledge77.6
SWE-Bench verifiedcoding75.4
SimpleQA Verifiedknowledge67.4
Terminal Benchcoding64.3
WeirdMLcoding61.6
VPCTknowledge58.9
MCP Atlasagentic57.4
SimpleBenchreasoning53.3
Artificial Analysis — Agentic Indexspeed49.7
Balrogknowledge48.1
Artificial Analysis — Quality Indexspeed46.4
Artificial Analysis — Coding Indexspeed42.6
Chess Puzzlesknowledge38.0
FrontierMath-2025-02-28-Privatemath35.6
ARC-AGI-2reasoning33.6
APEX-Agentsagentic24.0
SciPredictknowledge22.2
ARC-AGIreasoning21.5
GSO-Benchcoding9.8
FrontierMath-Tier-4-2025-07-01-Privatemath4.2