GPT-4.1-mini
by OpenAI
23.9
best score
23.9%
Best Score
bash-only
Best Leaderboard
1
Models Used
Yes
Open Source
Score History
| Entry | Score |
|---|---|
| GPT-5-2 Codex | 66.3% |
| GPT 5.2 Codex | 66.3% |
| GPT-5-2 Codex | 72.8% |
| GPT 5.2 Codex | 72.8% |
| GPT-5-2 (high reasoning) | 72.8% |
| GPT-5 Mini | 56.2% |
| GPT-5.2 (high reasoning) | 66.7% |
| GPT-5 mini | 39.7% |
| GPT-5.2 (2025-12-11) (high reasoning) | 71.8% |
| GPT-5.2 (2025-12-11) | 69.0% |
| GPT-5.1-codex (medium reasoning) | 66.0% |
| GPT-5.1 (2025-11-13) (medium reasoning) | 66.0% |
| GPT-5 (2025-08-07) (medium reasoning) | 65.0% |
| GPT-5 mini (2025-08-07) (medium reasoning) | 59.8% |
| GPT-5 nano (2025-08-07) (medium reasoning) | 34.8% |
| gpt-oss-120b | 26.0% |
| o4-mini (2025-04-16) | 45.0% |
| GPT-4.1 (2025-04-14) | 39.6% |
| GPT-4.1-mini (2025-04-14) | 23.9% |
| GPT-4o (2024-11-20) | 21.6% |
| mini-SWE-agent + GPT-4.1-mini (2025-04-14) | 23.9% |