API
Benchmarks/DeepResearch Bench

DeepResearch Bench

DeepResearch Bench β€” evaluates AI on complex multi-step research tasks requiring information gathering, synthesis, and producing comprehensive analyses.

12
Models Tested
52.6
Top Score
46.7
Average Score