Benchmark · KnowledgeCompetitive

SimpleQA Verified

SimpleQA Verified · short factual questions with verified answers, measuring factual accuracy and the tendency to hallucinate or provide incorrect information.

Updated 2026-03-05
Models tested
32
Top score
77.3
Gemini 3.1 Pro Preview
Median
36.8
min 5.9
Top-5 spread
σ 4.2
Competitive

Best score over time · one chart, every benchmark

SIMPLEQA VERIFIED30 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Nov 24Mar 25Jul 25Nov 25Mar 26RELEASE DATE →benchgecko.ai/benchmark/simpleqa-verified · frontier
Frontier on SimpleQA Verified rose from 6.7 to 77.3 in 16 months · +70.6 points · latest leader Gemini 3.1 Pro Preview from Google DeepMind.
Pink dots = frontier records · 6 totalClick to open model page

Same category · related evaluations