测试版
排行榜/Gemini 2.0 Flash
Google DeepMind logo

Gemini 2.0 Flash

来自 Google DeepMind · 发布于 2025-02-05

48.0
平均分
$0.10/1M
输入价格
$0.40/1M
输出价格
1.0M tokens (~524 books)
上下文窗口
multimodal
类型

Tested on 20 benchmarks with 48.0% average. Top scores: Chatbot Arena Elo — Overall (1360.0%), HELM — IFEval (84.1%), MATH level 5 (82.2%).

基准测试类别分数Bar
Chatbot Arena Elo — Overallarena1360.0
HELM — IFEvallanguage84.1
MATH level 5math82.2
HELM — WildBenchreasoning80.0
GeoBenchknowledge77.0
HELM — MMLU-Proknowledge73.7
MMLUknowledge72.9
Lech Mazur Writingknowledge71.5
Fiction.LiveBenchknowledge61.1
HELM — GPQAknowledge55.6
GPQA diamondknowledge52.2
HELM — Omni-MATHmath45.9
Aider polyglotcoding38.2
OTIS Mock AIME 2024-2025math31.0
CadEvalcoding30.0
WeirdMLcoding25.8
SimpleBenchreasoning17.3
The Agent Companyagentic11.4
FrontierMath-2025-02-28-Privatemath1.7
ARC-AGI-2reasoning1.3