How much does Gemini 2.5 Flash cost?

Gemini 2.5 Flash costs $0.30 per million input tokens and $2.50 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.003 per message.

What benchmarks has Gemini 2.5 Flash been tested on?

Gemini 2.5 Flash has been evaluated on 25 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1411.0, HELM — IFEval: 89.8, HELM — WildBench: 81.7.

Is Gemini 2.5 Flash open source?

No, Gemini 2.5 Flash is a proprietary model by Google DeepMind.

How does Gemini 2.5 Flash compare to Hermes 2 Pro - Llama-3 8B?

Gemini 2.5 Flash has an average score of 38.1 while Hermes 2 Pro - Llama-3 8B scores 38.2. Hermes 2 Pro - Llama-3 8B slightly outperforms Gemini 2.5 Flash overall. Gemini 2.5 Flash costs $0.30/1M input vs Hermes 2 Pro - Llama-3 8B at $0.14/1M input. See full comparison →

Home/Models/Gemini 2.5 Flash

Gemini 2.5 Flash

Name: Gemini 2.5 Flash
Price: 0.3 USD
Author: Google DeepMind

by Google DeepMind · Released Jun 2025

Multimodal1M Context

38.1

avg score

Rank #153

Compare

Better than 34% of all models

Context

1.0M tokens (~524 books)

Input $/1M

$0.30

Output $/1M

$2.50

Type

multimodal

License

Proprietary

Benchmarks

25 tested

Data updated today

About

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Tested on 25 benchmarks with 40.0% average. Top scores: Chatbot Arena Elo — Overall (1411.0%), HELM — IFEval (89.8%), HELM — WildBench (81.7%).

Looking for similar performance at lower cost?
GLM 4 32B scores 37.8 (99% as good) at $0.10/1M input · 67% cheaper

Capabilities

coding

35.1

#109 globally

reasoning

36.5

#74 globally

math

30.1

#132 globally

knowledge

41.5

#143 globally

agentic

41.1

#6 globally

language

89.8

#14 globally

Benchmark Scores

Compare All

Tested on 25 benchmarks · Ranked across 7 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

Aider polyglot

Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.

47.1—

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

41.0—

Terminal Bench

Complex terminal-based engineering tasks. Models must use command-line tools, navigate filesystems, and debug systems through shell interaction.

17.1—

reasoningCompare reasoning →

HELM — WildBench

Stanford HELM WildBench evaluation. Tests reasoning on challenging real-world tasks.

81.7—

ARC-AGI

Abstraction and Reasoning Corpus. Tests fluid intelligence through novel visual pattern recognition puzzles. Core measure of general intelligence.

32.3—

SimpleBench

Deceptively simple questions that humans find easy but AI models often get wrong. Tests common sense and reasoning gaps.

29.4—

mathCompare math →

OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

73.0—

HELM — Omni-MATH

Stanford HELM evaluation of mathematical reasoning across diverse problem types.

38.4—

FrontierMath-2025-02-28-Private

Original research-level math problems created by professional mathematicians. Problems are unpublished and cannot be memorized.

4.8—

Quick compare:

vs Hermes 2 Pro - Llama-3 8B

vs Command R+ (08-2024)

vs DeepSeek R1 Distill Qwen 7B

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Google DeepMind Gemini 2.5 Flash

Gemini 2.5 FlashJun 2025

40.0

$0.30/M in1.0Mctx25 benchmarks

Gemini 2.5 Flash LiteJul 2025

59.1+19.1

$0.10/M in(-0.20)1.0Mctx8 benchmarks

Gemini 2.5 Flash Lite Preview 09-2025Sep 2025

$0.10/M in1.0Mctx

See the full Gemini 2.5 Flash family →

Recently Happened

Gemini 2.5 Flash output price reduced to $2.50 per 1M tokens

Mar 5, 2026

Similar Models

Hermes 2 Pro - Llama-3 8B

DeepSeek R1 Distill Qwen 7B

DeepSeek

38.3TBD

Links

Info

Google DeepMind Pricing explorer Developers · API

Research

Technical Report

Documentation

API Docs Playground

Community

@Google DeepMind

BenchGecko API

gemini-2-5-flash

Specifications

Typemultimodal
Context1.0M tokens (~524 books)
ReleasedJun 2025
LicenseProprietary
StatusActive
Cost / Message~$0.003

Available On

Google DeepMind$0.30

Frequently Asked Questions

Gemini 2.5 Flash is a proprietary multimodal AI model by Google DeepMind, released in June 2025. It has an average benchmark score of 38.1. Context window: 1M tokens.

Benchmarks

Chatbot Arena Elo — Overall HELM — IFEval HELM — WildBench Lech Mazur Writing OTIS Mock AIME 2024-2025

Google DeepMind · Provider Google DeepMind · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

Gemini 2.5 Flash

Frequently Asked Questions

Related Models

Benchmarks

Related Pages