How much does Qwen2 VL 7B Instruct cost?

Qwen2 VL 7B Instruct is open source and can be self-hosted.

What benchmarks has Qwen2 VL 7B Instruct been tested on?

Qwen2 VL 7B Instruct has been evaluated on 11 benchmarks. Top scores: JSQuAD: 89.9, JCommonsenseQA: 87.8, JNLI: 74.4.

Is Qwen2 VL 7B Instruct open source?

Yes, Qwen2 VL 7B Instruct is open source.

How does Qwen2 VL 7B Instruct compare to GPT-5.1-Codex-Mini?

Qwen2 VL 7B Instruct has an average score of 67.6 while GPT-5.1-Codex-Mini scores 67.4. Qwen2 VL 7B Instruct outperforms GPT-5.1-Codex-Mini overall. See full comparison →

Home/Models/Qwen2 VL 7B Instruct

Qwen2 VL 7B Instruct

Name: Qwen2 VL 7B Instruct
Author: Alibaba

by Alibaba · Released Aug 2024

Open Source

67.6

avg score

Rank #44

Compare

Better than 81% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

image-text-to-text

License

Open Source

Benchmarks

11 tested

Data updated today

About

Qwen image text to text model. 1523K downloads on HuggingFace.

Tested on 11 benchmarks with 47.3% average. Top scores: JSQuAD (89.9%), JCommonsenseQA (87.8%), JNLI (74.4%).

Capabilities

reasoning

13.6

#115 globally

math

19.9

#162 globally

knowledge

21.8

#192 globally

language

67.9

#73 globally

general

35.9

#23 globally

Benchmark Scores

Compare All

Tested on 11 benchmarks · Ranked across 5 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

13.6—

mathCompare math →

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

19.9—

knowledgeCompare knowledge →

MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

34.4—

GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

9.3—

Quick compare:

vs GPT-5.1-Codex-Mini

vs Grok 3 Beta

vs Gemini 2.5 Pro

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Alibaba Qwen 2

Qwen2 0.5BMay 2024

7.2

N/AN/Actx6 benchmarks

Qwen2 0.5B InstructJun 2024

6.6-0.6

N/AN/Actx6 benchmarks

Qwen2 1.5B InstructJun 2024

14.1+7.5

N/AN/Actx6 benchmarks

Qwen2 7B InstructJun 2024

50.5+36.4

N/AN/Actx25 benchmarks

Qwen2 VL 2B InstructAug 2024

N/AN/Actx

Qwen2 VL 7B InstructAug 2024

47.3+47.3

N/AN/Actx11 benchmarks

Qwen2 VL 7B Instruct AWQAug 2024

N/AN/Actx

See the full Qwen 2 family →

Similar Models

Links

Info

Research

Documentation

Community

Source Code

BenchGecko API

qwen-qwen2-vl-7b-instruct

Specifications

Typeimage-text-to-text
ContextN/A
ReleasedAug 2024
LicenseOpen Source
StatusActive

Available On

AlibabaTBD

Frequently Asked Questions

Qwen2 VL 7B Instruct is an open-source image-text-to-text AI model by Alibaba, released in August 2024. It has an average benchmark score of 67.6.

Benchmarks

JSQuAD JCommonsenseQA JNLI JMMLU LLM-JP — Overall

Alibaba · Provider All Models Compare Models

Qwen2 VL 7B Instruct

Frequently Asked Questions

Related Models

Benchmarks

Related Pages