LIVETracking 971 AI models from 268 providers.

Models971·Providers268·Benchmarks128·Companies71·Agents165·TopQwen3 VL 235B A22B Instruct · 1415.8%·Updatedjust now·Data Points2,902·MCP Servers4,923

Home/Models/Qwen2.5 1.5B Instruct

Qwen2.5 1.5B Instruct

by Alibaba · Released Sep 2024

Open Source

27.3

avg score

Rank #185

Better than 20% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text-generation

License

Open Source

Benchmarks

6 tested

Data updated today

About

Qwen text generation model. 9532K downloads on HuggingFace.

Tested on 6 benchmarks with 18.4% average. Top scores: IFEval (44.8%), MATH Level 5 (22.1%), MMLU-PRO (20.0%).

Capabilities

reasoning

3.2

#171 globally

math

22.1

#152 globally

knowledge

10.4

#207 globally

language

44.8

#108 globally

general

19.8

#42 globally

Benchmark Scores

Tested on 6 benchmarks · Ranked across 5 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

reasoningCompare reasoning →

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

3.2—

mathCompare math →

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

22.1—

knowledgeCompare knowledge →

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

20.0—

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

0.8—

Quick compare:

vs Grok-2 (Dec 2024)

vs Mistral Large

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Alibaba Qwen 2.5

Qwen2.5 0.5B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 1.5B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 1.5B Instruct AWQSep 2024

Qwen2.5 1.5B Instruct GGUFSep 2024

Qwen2.5 14B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 14B Instruct AWQSep 2024

Qwen2.5 32B InstructSep 2024

N/AN/Actx7 benchmarks

Qwen2.5 32B Instruct AWQSep 2024

Qwen2.5 32B Instruct GPTQ Int4Sep 2024

Qwen2.5 3B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 3B Instruct GGUFSep 2024

Qwen2.5 72B Instruct AWQSep 2024

Qwen2.5 7B Instruct AWQSep 2024

Qwen2.5 Coder 0.5B InstructNov 2024

N/AN/Actx1 benchmark

Qwen2.5 Coder 1.5B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 Coder 14B InstructNov 2024

N/AN/Actx7 benchmarks

Qwen2.5 Coder 32B Instruct AWQNov 2024

Qwen2.5 Coder 7B Instruct AWQSep 2024

Qwen2.5 Coder 7B Instruct GPTQ Int4Sep 2024

Qwen2.5 Math 1.5BSep 2024

Qwen2.5 VL 3B InstructJan 2025

Qwen2.5 VL 7B InstructJan 2025

Qwen2.5 VL 7B Instruct AWQFeb 2025

See the full Qwen 2.5 family →

Similar Models

Grok-2 (Dec 2024)

Links

Info

Research

Technical Report

Documentation

API Docs Playground

Community

Source Code

GitHub Hugging Face

BenchGecko API

qwen-qwen25-15b-instruct

Specifications

Typetext-generation
ContextN/A
ReleasedSep 2024
LicenseOpen Source
StatusActive

Available On

AlibabaTBD

Categories

reasoning math knowledge language general

Learn More

transformer open-weights tokens

Share & Export

Related Models

Grok-2 (Dec 2024)

DeepSeek Coder 33B

Frequently Asked Questions

Qwen2.5 1.5B Instruct is an open-source text-generation AI model by Alibaba, released in September 2024. It has an average benchmark score of 27.3.

Related Models

Grok-2 (Dec 2024) · xAI MPT-30B · Unknown Mistral Large · Mistral AI DeepSeek Coder 33B · DeepSeek Llama 2 7b Chat Hf · Meta

Benchmarks

IFEval MATH Level 5 MMLU-PRO BBH (HuggingFace)MUSR

Related Pages

Alibaba · Provider All Models Compare Models