Gemini 2.0 Flash Thinking (Jan 2025)
by Google DeepMind · Released Jan 2024
32.9
avg score
Rank #170
Better than 27% of all models
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
text
License
Proprietary
Benchmarks
7 tested
Data updated today
About
Tested on 7 benchmarks with 37.7% average. Top scores: Lech Mazur Writing (73.8%), OTIS Mock AIME 2024-2025 (57.7%), Fiction.LiveBench (52.8%).
Capabilities
coding
18.2
#128 globally
reasoning
16.8
#107 globally
math
57.7
#54 globally
knowledge
42.8
#135 globally
Benchmark Scores
Compare AllTested on 7 benchmarks · Ranked across 4 categories
Score Distribution (all 233 models)
0255075100
▲ You are here
codingCompare coding →
Aider polyglot
18.2—Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.
reasoningCompare reasoning →
SimpleBench
16.8—Deceptively simple questions that humans find easy but AI models often get wrong. Tests common sense and reasoning gaps.
mathCompare math →
OTIS Mock AIME 2024-2025
57.7—Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Research
Documentation
Community
BenchGecko API
gemini-2-0-flash-thinking-jan-2025
Specifications
- Typetext
- ContextN/A
- ReleasedJan 2024
- LicenseProprietary
- Statusbenchmark-only
Available On
Learn More
Share & Export
Frequently Asked Questions
Gemini 2.0 Flash Thinking (Jan 2025) is a proprietary text AI model by Google DeepMind, released in January 2024. It has an average benchmark score of 32.9.