LIVETracking 971 AI models from 268 providers.

Models971·Providers268·Benchmarks128·Companies71·Agents165·TopQwen3 VL 235B A22B Instruct · 1415.8%·Updatedjust now·Data Points2,902·MCP Servers4,923

Home/Models/Qwen2.5 14B Instruct

Qwen2.5 14B Instruct

by Alibaba · Released Sep 2024

Open Source

71.3

avg score

Rank #36

Better than 84% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text-generation

License

Open Source

Benchmarks

6 tested

Data updated today

About

Qwen text generation model. 1503K downloads on HuggingFace.

Tested on 6 benchmarks with 41.6% average. Top scores: IFEval (81.4%), MATH Level 5 (55.3%), BBH (HuggingFace) (48.6%).

Capabilities

reasoning

10.6

#132 globally

math

55.3

#62 globally

knowledge

26.9

#181 globally

language

81.4

#47 globally

general

48.6

#13 globally

Benchmark Scores

Tested on 6 benchmarks · Ranked across 5 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

reasoningCompare reasoning →

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

10.6—

mathCompare math →

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

55.3—

knowledgeCompare knowledge →

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

43.2—

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

10.5—

Quick compare:

vs Qwen2.5 Coder 32B Instruct

vs o4 Mini High

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Alibaba Qwen 2.5

Qwen2.5 0.5B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 1.5B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 1.5B Instruct AWQSep 2024

Qwen2.5 1.5B Instruct GGUFSep 2024

Qwen2.5 14B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 14B Instruct AWQSep 2024

Qwen2.5 32B InstructSep 2024

N/AN/Actx7 benchmarks

Qwen2.5 32B Instruct AWQSep 2024

Qwen2.5 32B Instruct GPTQ Int4Sep 2024

Qwen2.5 3B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 3B Instruct GGUFSep 2024

Qwen2.5 72B Instruct AWQSep 2024

Qwen2.5 7B Instruct AWQSep 2024

Qwen2.5 Coder 0.5B InstructNov 2024

N/AN/Actx1 benchmark

Qwen2.5 Coder 1.5B InstructSep 2024

N/AN/Actx6 benchmarks

Qwen2.5 Coder 14B InstructNov 2024

N/AN/Actx7 benchmarks

Qwen2.5 Coder 32B Instruct AWQNov 2024

Qwen2.5 Coder 7B Instruct AWQSep 2024

Qwen2.5 Coder 7B Instruct GPTQ Int4Sep 2024

Qwen2.5 Math 1.5BSep 2024

Qwen2.5 VL 3B InstructJan 2025

Qwen2.5 VL 7B InstructJan 2025

Qwen2.5 VL 7B Instruct AWQFeb 2025

See the full Qwen 2.5 family →

Similar Models

Qwen2.5 Coder 32B Instruct

Links

Info

Research

Technical Report

Documentation

API Docs Playground

Community

Source Code

GitHub Hugging Face

BenchGecko API

qwen-qwen25-14b-instruct

Specifications

Typetext-generation
ContextN/A
ReleasedSep 2024
LicenseOpen Source
StatusActive

Available On

AlibabaTBD

Categories

reasoning math knowledge language general

Learn More

transformer open-weights tokens

Share & Export

Related Models

Qwen2.5 Coder 32B Instruct

Frequently Asked Questions

Qwen2.5 14B Instruct is an open-source text-generation AI model by Alibaba, released in September 2024. It has an average benchmark score of 71.3.

Related Models

Qwen2.5 Coder 32B Instruct · Alibaba Qwen o4 Mini High · OpenAI GLM 4.5 · z-ai MiniMax M2 · minimax MiniMax M2.7 · minimax

Benchmarks

IFEval MATH Level 5 BBH (HuggingFace)MMLU-PRO MUSR

Related Pages

Alibaba · Provider All Models Compare Models