Qwen3 4B Instruct 2507
Code source ouvertpar Alibaba · Sorti le 2025-08-05
47.2
score moyen
N/A
Prix d'entrée
N/A
Prix de sortie
N/A
Fenêtre de contexte
text-generation
Type
Tested on 6 benchmarks with 47.2% average. Top scores: OpenCompass — IFEval (82.4%), OpenCompass — MMLU-Pro (63.0%), OpenCompass — GPQA-Diamond (52.3%).
Scores de benchmark
| Benchmark | Catégorie | Score | Bar |
|---|---|---|---|
| OpenCompass — IFEval | language | 82.4 | |
| OpenCompass — MMLU-Pro | knowledge | 63.0 | |
| OpenCompass — GPQA-Diamond | knowledge | 52.3 | |
| OpenCompass — AIME2025 | math | 46.9 | |
| OpenCompass — LiveCodeBenchV6 | coding | 33.5 | |
| OpenCompass — HLE | knowledge | 5.1 |
Modèles similaires
Alibaba
47.3
Google DeepMind
47.4
Meta
46.9
OpenAI
46.9