베타
리더보드/Gemini 2.0 Flash
Google DeepMind logo

Gemini 2.0 Flash

제공 Google DeepMind · 출시일 2025-02-05

48.0
평균 점수
$0.10/1M
입력 가격
$0.40/1M
출력 가격
1.0M tokens (~524 books)
컨텍스트 윈도우
multimodal
유형

Tested on 20 benchmarks with 48.0% average. Top scores: Chatbot Arena Elo — Overall (1360.0%), HELM — IFEval (84.1%), MATH level 5 (82.2%).

벤치마크카테고리점수Bar
Chatbot Arena Elo — Overallarena1360.0
HELM — IFEvallanguage84.1
MATH level 5math82.2
HELM — WildBenchreasoning80.0
GeoBenchknowledge77.0
HELM — MMLU-Proknowledge73.7
MMLUknowledge72.9
Lech Mazur Writingknowledge71.5
Fiction.LiveBenchknowledge61.1
HELM — GPQAknowledge55.6
GPQA diamondknowledge52.2
HELM — Omni-MATHmath45.9
Aider polyglotcoding38.2
OTIS Mock AIME 2024-2025math31.0
CadEvalcoding30.0
WeirdMLcoding25.8
SimpleBenchreasoning17.3
The Agent Companyagentic11.4
FrontierMath-2025-02-28-Privatemath1.7
ARC-AGI-2reasoning1.3