Benchmark · Knowledge

MMMLU · Japanese

Updated 2024-06-04
Models tested
2
Top score
56.6
Qwen2 7B Instruct
Median
49.5
min 42.3
Top-5 spread
σ 7.2
wide open
MMMLU · JAPANESE \u00B7 TOP 20255075100#1Qwen2 7B Instruct56.6#2Meta Llama 3 8B Instruct42.3benchgecko.ai/benchmark/mmmlu-ja

2 models tested · sorted by score

Same category · related evaluations