Benchmark · Knowledge

CSQA2

Updated 2024-01-25
Models tested
2
Top score
14.0
GPT-3.5 Turbo (older v0613)
Median
7.0
min 0.1
Top-5 spread
σ 7.0
wide open
CSQA2 \u00B7 TOP 20255075100#1GPT-3.5 Turbo (older v0…14.0#2Llama 2-13B0.1benchgecko.ai/benchmark/csqa2

2 models tested · sorted by score

#ModelScore
1OpenAI logoGPT-3.5 Turbo (older v0613)14.0
2Meta logoLlama 2-13B0.1

Same category · related evaluations