测试版
排行榜/Gemini 2.5 Flash
Google DeepMind logo

Gemini 2.5 Flash

来自 Google DeepMind · 发布于 2025-06-17

40.0
平均分
$0.30/1M
输入价格
$2.50/1M
输出价格
1.0M tokens (~524 books)
上下文窗口
multimodal
类型

Tested on 25 benchmarks with 40.0% average. Top scores: Chatbot Arena Elo — Overall (1411.0%), HELM — IFEval (89.8%), HELM — WildBench (81.7%).

基准测试类别分数Bar
Chatbot Arena Elo — Overallarena1411.0
HELM — IFEvallanguage89.8
HELM — WildBenchreasoning81.7
Lech Mazur Writingknowledge76.5
OTIS Mock AIME 2024-2025math73.0
GeoBenchknowledge73.0
HELM — MMLU-Proknowledge63.9
Fiction.LiveBenchknowledge47.2
Aider polyglotcoding47.1
The Agent Companyagentic41.1
WeirdMLcoding41.0
AudioMultiChallengeknowledge40.0
AudioMultiChallenge — Text Outputknowledge40.0
HELM — GPQAknowledge39.0
HELM — Omni-MATHmath38.4
Balrogknowledge33.5
ARC-AGIreasoning32.3
SimpleBenchreasoning29.4
DeepResearch Benchknowledge29.2
Terminal Benchcoding17.1
HLEknowledge7.7
VPCTknowledge7.0
FrontierMath-2025-02-28-Privatemath4.8
FrontierMath-Tier-4-2025-07-01-Privatemath4.2
ARC-AGI-2reasoning2.5