베타
리더보드/Qwen3 4B Thinking 2507
Alibaba logo

Qwen3 4B Thinking 2507

오픈소스

제공 Alibaba · 출시일 2025-08-05

60.6
평균 점수
N/A
입력 가격
N/A
출력 가격
N/A
컨텍스트 윈도우
text-generation
유형

Tested on 6 benchmarks with 60.6% average. Top scores: OpenCompass — IFEval (88.5%), OpenCompass — AIME2025 (80.0%), OpenCompass — MMLU-Pro (72.8%).

벤치마크카테고리점수Bar
OpenCompass — IFEvallanguage88.5
OpenCompass — AIME2025math80.0
OpenCompass — MMLU-Proknowledge72.8
OpenCompass — GPQA-Diamondknowledge64.7
OpenCompass — LiveCodeBenchV6coding51.6
OpenCompass — HLEknowledge6.0