How much does Qwen2-72B cost?

Qwen2-72B is open source and can be self-hosted.

What benchmarks has Qwen2-72B been tested on?

Qwen2-72B has been evaluated on 12 benchmarks. Top scores: CMMLU: 89.7, MMLU: 76.5, Aider — Code Editing: 55.6.

How does Qwen2-72B compare to Grok 4?

Qwen2-72B has an average score of 62.3 while Grok 4 scores 62.2. Qwen2-72B outperforms Grok 4 overall. See full comparison →

Home/Models/Qwen2-72B

Qwen2-72B

Name: Qwen2-72B
Author: Alibaba Qwen

by Alibaba Qwen · Released Jan 2024

Open Source

62.3

avg score

Rank #58

Compare

Better than 75% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text

License

Open Source

Benchmarks

12 tested

Data updated today

About

Tested on 12 benchmarks with 41.3% average. Top scores: CMMLU (89.7%), MMLU (76.5%), Aider — Code Editing (55.6%).

Capabilities

coding

55.6

#51 globally

reasoning

19.7

#102 globally

math

35.1

#119 globally

knowledge

51.8

#91 globally

agentic

1.1

#38 globally

language

38.2

#116 globally

general

51.9

#9 globally

Benchmark Scores

Compare All

Tested on 12 benchmarks · Ranked across 7 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

Aider — Code Editing

Code editing benchmark from the Aider project. Measures ability to apply targeted code changes while maintaining correctness and style.

55.6—

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

19.7—

mathCompare math →

MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

39.1—

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

31.1—

Quick compare:

vs Grok 4

vs GPT-3.5 Turbo (older v0613)

vs GPT-5 Mini

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

Grok 4

xAI

62.2$3.00/1M

GPT-3.5 Turbo (older v0613)

Links

Info

Alibaba Qwen Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

qwen2-72b

Specifications

Typetext
ContextN/A
ReleasedJan 2024
LicenseOpen Source
Statusbenchmark-only

Available On

Alibaba QwenTBD

Frequently Asked Questions

Qwen2-72B is an open-source text AI model by Alibaba Qwen, released in January 2024. It has an average benchmark score of 62.3.

Benchmarks

CMMLU MMLU Aider — Code Editing MMLU-PRO BBH (HuggingFace)

Alibaba Qwen · Provider Alibaba Qwen · Economy All Models Compare Models Pricing Developers · API

Qwen2-72B

Frequently Asked Questions

Related Models

Benchmarks

Related Pages