How much does Qwen2.5 Coder 7B Instruct cost?

Qwen2.5 Coder 7B Instruct is open source and can be self-hosted.

What benchmarks has Qwen2.5 Coder 7B Instruct been tested on?

Qwen2.5 Coder 7B Instruct has been evaluated on 12 benchmarks. Top scores: GSM8K: 86.7, HellaSwag: 69.1, IFEval: 61.0.

Is Qwen2.5 Coder 7B Instruct open source?

Yes, Qwen2.5 Coder 7B Instruct is open source.

How does Qwen2.5 Coder 7B Instruct compare to Qwen2.5 Coder 7B Instruct?

Qwen2.5 Coder 7B Instruct has an average score of 56.6 while Qwen2.5 Coder 7B Instruct scores 56.6. Qwen2.5 Coder 7B Instruct slightly outperforms Qwen2.5 Coder 7B Instruct overall. See full comparison →

Home/Models/Qwen2.5 Coder 7B Instruct

Qwen2.5 Coder 7B Instruct

Name: Qwen2.5 Coder 7B Instruct
Author: Alibaba

by Alibaba · Released Sep 2024

Open Source

56.6

avg score

Rank #106

Compare

Better than 61% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text-generation

License

Open Source

Benchmarks

12 tested

Data updated today

About

Qwen text generation model. 2605K downloads on HuggingFace.

Tested on 12 benchmarks with 44.4% average. Top scores: GSM8K (86.7%), HellaSwag (69.1%), IFEval (61.0%).

Capabilities

coding

57.9

#62 globally

reasoning

9.5

#166 globally

math

61.9

#57 globally

knowledge

42.0

#169 globally

language

61.0

#99 globally

general

28.9

#33 globally

Benchmark Scores

Compare All

Tested on 12 benchmarks · Ranked across 6 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

codingCompare coding →

Aider — Code Editing

Code editing benchmark from the Aider project. Measures ability to apply targeted code changes while maintaining correctness and style.

57.9—

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

9.5—

mathCompare math →

GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

86.7—

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

37.2—

Quick compare:

vs Qwen2.5 Coder 7B Instruct

vs Qwen3 Next 80B A3B Thinking

vs DeepSeek V3.2 Exp

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

Qwen2.5 Coder 7B Instruct

Alibaba Qwen

56.6$0.03/1M

Qwen3 Next 80B A3B Thinking

Links

Info

Alibaba Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

qwen-qwen25-coder-7b-instruct

Specifications

Typetext-generation
ContextN/A
ReleasedSep 2024
LicenseOpen Source
StatusActive

Available On

AlibabaTBD

Frequently Asked Questions

Qwen2.5 Coder 7B Instruct is an open-source text-generation AI model by Alibaba, released in September 2024. It has an average benchmark score of 56.6.

Benchmarks

GSM8K HellaSwag IFEval Aider — Code Editing MMLU

Alibaba · Provider Alibaba · Economy All Models Compare Models Pricing Developers · API

Qwen2.5 Coder 7B Instruct

Frequently Asked Questions

Related Models

Benchmarks

Related Pages