How much does Qwen2.5 32B Instruct cost?

Qwen2.5 32B Instruct is open source and can be self-hosted.

What benchmarks has Qwen2.5 32B Instruct been tested on?

Qwen2.5 32B Instruct has been evaluated on 7 benchmarks. Top scores: IFEval: 83.5, MATH Level 5: 62.5, BBH (HuggingFace): 56.5.

Is Qwen2.5 32B Instruct open source?

Yes, Qwen2.5 32B Instruct is open source.

How does Qwen2.5 32B Instruct compare to Claude Opus 4.6?

Qwen2.5 32B Instruct has an average score of 81.3 while Claude Opus 4.6 scores 81.1. Qwen2.5 32B Instruct outperforms Claude Opus 4.6 overall. See full comparison →

Home/Models/Qwen2.5 32B Instruct

Qwen2.5 32B Instruct

Name: Qwen2.5 32B Instruct
Author: Alibaba

by Alibaba · Released Sep 2024

Open Source

81.3

avg score

Rank #20

Compare

Better than 91% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text-generation

License

Open Source

Benchmarks

7 tested

Data updated today

About

Qwen text generation model. 3882K downloads on HuggingFace.

Tested on 7 benchmarks with 43.2% average. Top scores: IFEval (83.5%), MATH Level 5 (62.5%), BBH (HuggingFace) (56.5%).

Capabilities

reasoning

13.5

#116 globally

math

62.5

#43 globally

knowledge

31.8

#170 globally

language

83.5

#39 globally

general

56.5

#4 globally

safety

22.9

#3 globally

Benchmark Scores

Compare All

Tested on 7 benchmarks · Ranked across 6 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

13.5—

mathCompare math →

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

62.5—

knowledgeCompare knowledge →

MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

51.9—

GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

11.7—

Quick compare:

vs Claude Opus 4.6

vs MiMo-V2-Flash

vs GPT-5.3-Codex

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Alibaba Qwen 2.5

Qwen2.5 0.5B InstructSep 2024

10.1

N/AN/Actx6 benchmarks

Qwen2.5 1.5B InstructSep 2024

18.4+8.3

N/AN/Actx6 benchmarks

Qwen2.5 1.5B Instruct AWQSep 2024

N/AN/Actx

Qwen2.5 1.5B Instruct GGUFSep 2024

N/AN/Actx

Qwen2.5 14B InstructSep 2024

41.6+41.6

N/AN/Actx6 benchmarks

Qwen2.5 14B Instruct AWQSep 2024

N/AN/Actx

Qwen2.5 32B InstructSep 2024

43.2+43.2

N/AN/Actx7 benchmarks

Qwen2.5 32B Instruct AWQSep 2024

N/AN/Actx

Qwen2.5 32B Instruct GPTQ Int4Sep 2024

N/AN/Actx

Qwen2.5 3B InstructSep 2024

27.2+27.2

N/AN/Actx6 benchmarks

Qwen2.5 3B Instruct GGUFSep 2024

N/AN/Actx

Qwen2.5 72B Instruct AWQSep 2024

N/AN/Actx

Qwen2.5 7B Instruct AWQSep 2024

N/AN/Actx

Qwen2.5 Coder 0.5B InstructNov 2024

14.3+14.3

N/AN/Actx1 benchmark

Qwen2.5 Coder 1.5B InstructSep 2024

38.8+24.5

N/AN/Actx6 benchmarks

Qwen2.5 Coder 14B InstructNov 2024

37.4-1.4

N/AN/Actx7 benchmarks

Qwen2.5 Coder 32B Instruct AWQNov 2024

N/AN/Actx

Qwen2.5 Coder 7B Instruct AWQSep 2024

N/AN/Actx

Qwen2.5 Coder 7B Instruct GPTQ Int4Sep 2024

N/AN/Actx

Qwen2.5 Math 1.5BSep 2024

N/AN/Actx

Qwen2.5 VL 3B InstructJan 2025

N/AN/Actx

Qwen2.5 VL 7B InstructJan 2025

N/AN/Actx

Qwen2.5 VL 7B Instruct AWQFeb 2025

N/AN/Actx

See the full Qwen 2.5 family →

Similar Models

Links

Info

Research

Documentation

Community

Source Code

BenchGecko API

qwen-qwen25-32b-instruct

Specifications

Typetext-generation
ContextN/A
ReleasedSep 2024
LicenseOpen Source
StatusActive

Available On

AlibabaTBD

Frequently Asked Questions

Qwen2.5 32B Instruct is an open-source text-generation AI model by Alibaba, released in September 2024. It has an average benchmark score of 81.3.

Benchmarks

IFEval MATH Level 5 BBH (HuggingFace)MMLU-PRO PropensityBench

Alibaba · Provider All Models Compare Models

Qwen2.5 32B Instruct

Frequently Asked Questions

Related Models

Benchmarks

Related Pages