Fiction.LiveBench
Fiction.LiveBench β a continuously updated benchmark using recently published fiction to test reading comprehension and reasoning, preventing data contamination.
53
Models Tested
97.2
Top Score
60.4
Average Score
Fiction.LiveBench β a continuously updated benchmark using recently published fiction to test reading comprehension and reasoning, preventing data contamination.