Benchmark · Knowledge

PropensityBench

Updated 2024-09-17
Models tested
1
Top score
22.9
Qwen2.5 32B Instruct
Median
22.9
min 22.9
Top-5 spread
σ 0.0
Settled

1 models tested · sorted by score

#ModelScore
1Alibaba logoQwen2.5 32B Instruct22.9

Same category · related evaluations