Chess Puzzles
Chess Puzzles β tests strategic and tactical reasoning by having models solve chess puzzle positions, evaluating lookahead and pattern recognition abilities.
29
Models Tested
55.0
Top Score
23.2
Average Score
Rankings
| # | Model | Score | Bar |
|---|---|---|---|
| 1 | 55.0 | ||
| 2 | 49.0 | ||
| 3 | 49.0 | ||
| 4 | 38.0 | ||
| 5 | 37.0 | ||
| 6 | 37.0 | ||
| 7 | 32.0 | ||
| 8 | 32.0 | ||
| 9 | 31.0 | ||
| 10 | 28.0 | ||
| 11 | 26.0 | ||
| 12 | M Kimi K2 Thinkingmoonshotai | 20.0 | |
| 13 | 20.0 | ||
| 14 | 20.0 | ||
| 15 | 20.0 | ||
| 16 | 20.0 | ||
| 17 | 20.0 | ||
| 18 | 17.0 | ||
| 19 | 17.0 | ||
| 20 | 14.0 | ||
| 21 | 13.0 | ||
| 22 | M Kimi K2.5moonshotai | 12.0 | |
| 23 | 12.0 | ||
| 24 | 12.0 | ||
| 25 | 12.0 | ||
| 26 | ZA GLM 5 Turboz-ai | 10.0 | |
| 27 | ZA GLM 5z-ai | 10.0 | |
| 28 | ZA GLM 4.7z-ai | 6.0 | |
| 29 | 4.0 |