API
Benchmarks/Chess Puzzles

Chess Puzzles

Chess Puzzles β€” tests strategic and tactical reasoning by having models solve chess puzzle positions, evaluating lookahead and pattern recognition abilities.

29
Models Tested
55.0
Top Score
23.2
Average Score