How much does Qwen2 1.5B Instruct cost?

Qwen2 1.5B Instruct is open source and can be self-hosted.

What benchmarks has Qwen2 1.5B Instruct been tested on?

Qwen2 1.5B Instruct has been evaluated on 6 benchmarks. Top scores: IFEval: 33.7, MMLU-PRO: 16.7, BBH (HuggingFace): 13.7.

Is Qwen2 1.5B Instruct open source?

Yes, Qwen2 1.5B Instruct is open source.

How does Qwen2 1.5B Instruct compare to Claude 2.1?

Qwen2 1.5B Instruct has an average score of 24.3 while Claude 2.1 scores 24.0. Qwen2 1.5B Instruct outperforms Claude 2.1 overall. See full comparison →

Home/Models/Qwen2 1.5B Instruct

Qwen2 1.5B Instruct

Name: Qwen2 1.5B Instruct
Author: Alibaba

by Alibaba · Released Jun 2024

Open Source

24.3

avg score

Rank #196

Compare

Better than 15% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text-generation

License

Open Source

Benchmarks

6 tested

Data updated today

About

Qwen text generation model. 3026K downloads on HuggingFace.

Tested on 6 benchmarks with 14.1% average. Top scores: IFEval (33.7%), MMLU-PRO (16.7%), BBH (HuggingFace) (13.7%).

Capabilities

reasoning

12.0

#122 globally

math

7.2

#193 globally

knowledge

9.1

#211 globally

language

33.7

#119 globally

general

13.7

#48 globally

Benchmark Scores

Compare All

Tested on 6 benchmarks · Ranked across 5 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

12.0—

mathCompare math →

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

7.2—

knowledgeCompare knowledge →

MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

16.7—

GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

1.6—

Quick compare:

vs Claude 2.1

vs Mixtral 8x22B Instruct

vs GPT-5.4 Mini

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Alibaba Qwen 2

Qwen2 0.5BMay 2024

7.2

N/AN/Actx6 benchmarks

Qwen2 0.5B InstructJun 2024

6.6-0.6

N/AN/Actx6 benchmarks

Qwen2 1.5B InstructJun 2024

14.1+7.5

N/AN/Actx6 benchmarks

Qwen2 7B InstructJun 2024

50.5+36.4

N/AN/Actx25 benchmarks

Qwen2 VL 2B InstructAug 2024

N/AN/Actx

Qwen2 VL 7B InstructAug 2024

47.3+47.3

N/AN/Actx11 benchmarks

Qwen2 VL 7B Instruct AWQAug 2024

N/AN/Actx

See the full Qwen 2 family →

Similar Models

Claude 2.1

Anthropic

24.0TBD

Mixtral 8x22B Instruct

Links

Info

Research

Documentation

Community

Source Code

BenchGecko API

qwen-qwen2-15b-instruct

Specifications

Typetext-generation
ContextN/A
ReleasedJun 2024
LicenseOpen Source
StatusActive

Available On

AlibabaTBD

Frequently Asked Questions

Qwen2 1.5B Instruct is an open-source text-generation AI model by Alibaba, released in June 2024. It has an average benchmark score of 24.3.

Benchmarks

IFEval MMLU-PRO BBH (HuggingFace)MUSR MATH Level 5

Alibaba · Provider All Models Compare Models

Qwen2 1.5B Instruct

Frequently Asked Questions

Related Models

Benchmarks

Related Pages