Claude 3.7 Sonnet
by Anthropic
52.8
best score
52.8%
Best Score
bash-only
Best Leaderboard
1
Models Used
Yes
Open Source
Score History
| Entry | Score |
|---|---|
| Claude 4.5 Opus (high reasoning) | 76.8% |
| Claude Opus 4.6 | 75.6% |
| Claude 4.5 Sonnet (high reasoning) | 71.4% |
| Claude 4.5 Haiku (high reasoning) | 66.6% |
| Claude 4.6 Opus | 72.0% |
| Claude 4.5 Opus | 70.7% |
| Claude 4.5 Sonnet | 67.0% |
| Claude 4.5 Haiku | 64.7% |
| Claude 4.5 Opus medium (20251101) | 74.4% |
| Claude 4.5 Sonnet (20250929) | 70.6% |
| Claude 4 Opus (20250514) | 67.6% |
| Claude 4 Sonnet (20250514) | 64.9% |
| Claude 3.7 Sonnet (20250219) | 52.8% |
| mini-SWE-agent + Claude 3.7 Sonnet (20250219) | 52.8% |
| Aime-coder v1 + Anthopic Claude 3.7 Sonnet | 66.4% |
| SWE-agent 1.0 (Claude 3.7 Sonnet) | 33.8% |
| SWE-agent + Claude 3.7 Sonnet | 48.0% |
| SWE-agent + Claude 3.7 Sonnet w/ Review Heavy | 62.4% |
| Tools + Claude 3.7 Sonnet (2025-02-24) | 63.2% |
| Tools + Claude 3.5 Sonnet (2024-10-22) | 49.0% |
| Tools + Claude 3.5 Haiku | 40.6% |