API
Benchmarks/Fiction.LiveBench

Fiction.LiveBench

Fiction.LiveBench β€” a continuously updated benchmark using recently published fiction to test reading comprehension and reasoning, preventing data contamination.

53
Models Tested
97.2
Top Score
60.4
Average Score