Benchmark · Knowledge

MMMLU · Chinese

Updated 2024-06-04
Models tested
2
Top score
61.8
Qwen2 7B Instruct
Median
56.6
min 51.4
Top-5 spread
σ 5.2
wide open
MMMLU · CHINESE \u00B7 TOP 20255075100#1Qwen2 7B Instruct61.8#2Meta Llama 3 8B Instruct51.4benchgecko.ai/benchmark/mmmlu-zh

2 models tested · sorted by score

Same category · related evaluations