How much does Qwen-1_8B cost?

Qwen-1_8B is open source and can be self-hosted.

What benchmarks has Qwen-1_8B been tested on?

Qwen-1_8B has been evaluated on 6 benchmarks. Top scores: LAMBADA: 58.4, PIQA: 46.6, ARC AI2: 37.6.

How does Qwen-1_8B compare to Qwen2.5 Coder 0.5B Instruct?

Qwen-1_8B has an average score of 15.9 while Qwen2.5 Coder 0.5B Instruct scores 16.1. Qwen2.5 Coder 0.5B Instruct slightly outperforms Qwen-1_8B overall. See full comparison →

Home/Models/Qwen-1_8B

Qwen-1_8B

Name: Qwen-1_8B
Author: Alibaba Qwen

by Alibaba Qwen · Released Jan 2024

Open Source

15.9

avg score

Rank #256

Compare

Better than 7% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text

License

Open Source

Benchmarks

6 tested

Data updated today

About

Tested on 6 benchmarks with 28.7% average. Top scores: LAMBADA (58.4%), PIQA (46.6%), ARC AI2 (37.6%).

Capabilities

reasoning

4.3

#190 globally

math

21.2

#186 globally

knowledge

36.7

#190 globally

Benchmark Scores

Compare All

Tested on 6 benchmarks · Ranked across 3 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

reasoningCompare reasoning →

BBH

BIG-Bench Hard. 23 challenging tasks from BIG-Bench where prior language models fell below average human performance.

4.3—

mathCompare math →

GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

21.2—

knowledgeCompare knowledge →

LAMBADA

Language modeling benchmark testing ability to predict the last word of passages requiring long-range context understanding.

58.4—

PIQA

Physical Intuition QA. Tests understanding of everyday physical interactions and commonsense physics.

46.6—

ARC AI2

AI2 Reasoning Challenge. Grade-school science questions requiring multi-step reasoning. Easy and Challenge sets test different difficulty levels.

37.6—

Quick compare:

vs Qwen2.5 Coder 0.5B Instruct

vs Llama 4 Scout

vs Magistral Small 1.1

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

Qwen2.5 Coder 0.5B Instruct

Alibaba

16.1TBD

Llama 4 Scout

Frequently Asked Questions

Qwen-1_8B is an open-source text AI model by Alibaba Qwen, released in January 2024. It has an average benchmark score of 15.9.

Benchmarks

LAMBADA PIQA ARC AI2 GSM8K BBH

Alibaba Qwen · Provider Alibaba Qwen · Economy All Models Compare Models Pricing Developers · API

Qwen-1_8B

Frequently Asked Questions

Related Models

Benchmarks

Related Pages