Gemini 2.5 Pro
by Google DeepMind
53.6
best score
53.6%
Best Score
bash-only
Best Leaderboard
1
Models Used
Yes
Open Source
Score History
| Entry | Score |
|---|---|
| Gemini 3 Pro | 69.6% |
| Gemini 3 Flash (high reasoning) | 75.8% |
| Gemini 3 Flash | 72.7% |
| Gemini 3 Pro | 68.7% |
| Gemini 3 Pro Preview (2025-11-18) | 74.2% |
| Gemini 2.5 Pro (2025-05-06) | 53.6% |
| Gemini 2.5 Flash (2025-04-17) | 28.7% |
| Gemini 2.0 flash | 13.5% |
| mini-SWE-agent + Gemini 2.5 Pro (2025-05-06) | 53.6% |
| TRAE + Claude Sonnet 4 + Opus 4 + Sonnet 3.7 + Gemini 2.5 Pro | 75.2% |