Qwen3 4B Instruct 2507
Open Sourcedi Alibaba · Rilascio 2025-08-05
47.2
punteggio medio
N/A
Prezzo Input
N/A
Prezzo Output
N/A
Finestra di Contesto
text-generation
Tipo
Tested on 6 benchmarks with 47.2% average. Top scores: OpenCompass — IFEval (82.4%), OpenCompass — MMLU-Pro (63.0%), OpenCompass — GPQA-Diamond (52.3%).
Punteggi Benchmark
| Benchmark | Categoria | Punteggio | Bar |
|---|---|---|---|
| OpenCompass — IFEval | language | 82.4 | |
| OpenCompass — MMLU-Pro | knowledge | 63.0 | |
| OpenCompass — GPQA-Diamond | knowledge | 52.3 | |
| OpenCompass — AIME2025 | math | 46.9 | |
| OpenCompass — LiveCodeBenchV6 | coding | 33.5 | |
| OpenCompass — HLE | knowledge | 5.1 |
Modelli Simili
Alibaba
47.3
Google DeepMind
47.4
Meta
46.9
OpenAI
46.9