Knowledge & QA
Factual knowledge, question answering, and academic reasoning β tested across science, history, medicine, law, and more.
170
Models Ranked
91.7
Top Score
49.9
Average Score
20
Benchmarks
Benchmarks in This Skill
ARC AI2HellaSwagLAMBADAMMLUGPQA diamondWinograndeLech Mazur WritingFiction.LiveBenchSimpleQA VerifiedChess PuzzlesHLETriviaQAScienceQAPIQAOpenBookQABalrogGeoBenchANLIDeepResearch BenchVPCT