o4-mini
by OpenAI
45.0
best score
45.0%
Best Score
bash-only
Best Leaderboard
1
Models Used
Yes
Open Source
Score History
| Entry | Score |
|---|---|
| GPT-5 Mini | 56.2% |
| GPT-5 mini | 39.7% |
| GPT-5 mini (2025-08-07) (medium reasoning) | 59.8% |
| o4-mini (2025-04-16) | 45.0% |
| mini-SWE-agent + o4-mini (2025-04-16) | 45.0% |
| GPT-4.1-mini (2025-04-14) | 23.9% |
| Refact.ai Agent + Claude 4 Sonnet + o4-mini | 74.4% |
| GUIRepair + o4-mini (2025-04-16) | 33.9% |