ベータ
ランキング/Gemini 2.5 Flash
Google DeepMind logo

Gemini 2.5 Flash

開発元 Google DeepMind · リリース日 2025-06-17

40.0
平均スコア
$0.30/1M
入力料金
$2.50/1M
出力料金
1.0M tokens (~524 books)
コンテキストウィンドウ
multimodal
タイプ

Tested on 25 benchmarks with 40.0% average. Top scores: Chatbot Arena Elo — Overall (1411.0%), HELM — IFEval (89.8%), HELM — WildBench (81.7%).

ベンチマークカテゴリスコアBar
Chatbot Arena Elo — Overallarena1411.0
HELM — IFEvallanguage89.8
HELM — WildBenchreasoning81.7
Lech Mazur Writingknowledge76.5
OTIS Mock AIME 2024-2025math73.0
GeoBenchknowledge73.0
HELM — MMLU-Proknowledge63.9
Fiction.LiveBenchknowledge47.2
Aider polyglotcoding47.1
The Agent Companyagentic41.1
WeirdMLcoding41.0
AudioMultiChallengeknowledge40.0
AudioMultiChallenge — Text Outputknowledge40.0
HELM — GPQAknowledge39.0
HELM — Omni-MATHmath38.4
Balrogknowledge33.5
ARC-AGIreasoning32.3
SimpleBenchreasoning29.4
DeepResearch Benchknowledge29.2
Terminal Benchcoding17.1
HLEknowledge7.7
VPCTknowledge7.0
FrontierMath-2025-02-28-Privatemath4.8
FrontierMath-Tier-4-2025-07-01-Privatemath4.2
ARC-AGI-2reasoning2.5