实时正在追踪来自268家提供商的976个AI模型。

BenchGecko测试版

模型976·提供商268·基准测试128·公司71·智能体165·榜首Qwen3 VL 235B A22B Instruct · 1415.8%·已更新刚刚·数据点2,902·MCP服务器4,923

排行榜/Qwen3 235B A22B Instruct 2507

Qwen3 235B A22B Instruct 2507

开源

来自 Alibaba Qwen · 发布于 2025-07-21

48.5

平均分

$0.07/1M

输入价格

$0.10/1M

输出价格

262K tokens (~131 books)

上下文窗口

text

类型

Tested on 20 benchmarks with 48.5% average. Top scores: Chatbot Arena Elo — Overall (1422.6%), OpenCompass — IFEval (88.3%), OpenCompass — MMLU-Pro (79.2%).

基准测试分数

基准测试	类别	分数	Bar
Chatbot Arena Elo — Overall	arena	1422.6
OpenCompass — IFEval	language	88.3
OpenCompass — MMLU-Pro	knowledge	79.2
OpenCompass — GPQA-Diamond	knowledge	75.5
LiveBench — Coding	coding	69.6
OpenCompass — AIME2025	math	69.5
LiveBench — Mathematics	math	68.0
LiveBench — Language	language	66.1
Aider polyglot	coding	59.6
LiveBench — Reasoning	reasoning	58.4
Fiction.LiveBench	knowledge	52.9
LiveBench — Overall	knowledge	48.8
LiveBench — Data Analysis	reasoning	44.7
OpenCompass — LiveCodeBenchV6	coding	43.0
WeirdML	coding	38.7
LiveBench — If	language	21.7
LiveBench — Agentic Coding	coding	13.3
OpenCompass — HLE	knowledge	12.3
ARC-AGI	reasoning	11.0
ARC-AGI-2	reasoning	1.3

相似模型

Qwen2.5 72B Instruct Abliterated

Gemini 2.0 Flash

Google DeepMind

Gemini 3 Flash Preview

Google DeepMind

Stable Beluga 2

Alibaba Qwen Qwen 3 时间线

Qwen3 14BApr 2025

$0.06/M in41Kctx

Qwen3 235B A22BApr 2025

$0.46/M in(+0.40)131Kctx(+90K)8 benchmarks

Qwen3 235B A22B Instruct 2507Jul 2025

$0.07/M in(-0.38)262Kctx(+131K)20 benchmarks

Qwen3 235B A22B Thinking 2507Jul 2025

$0.13/M in(+0.06)262Kctx24 benchmarks

Qwen3 30B A3BApr 2025

$0.08/M in(-0.05)41Kctx(-221K)1 benchmark

Qwen3 30B A3B Instruct 2507Jul 2025

$0.09/M in(+0.01)262Kctx(+221K)7 benchmarks

Qwen3 30B A3B Thinking 2507Aug 2025

$0.08/M in(-0.01)131Kctx(-131K)6 benchmarks

Qwen3 32BApr 2025

$0.08/M in41Kctx(-90K)8 benchmarks

Qwen3 4B (free)Apr 2025

$0.00/M in(-0.08)41Kctx

Qwen3 8BApr 2025

$0.05/M in(+0.05)41Kctx6 benchmarks

Qwen3 Coder 30B A3B InstructJul 2025

$0.07/M in(+0.02)160Kctx(+119K)

Qwen3 Coder 480B A35BJul 2025

$0.22/M in(+0.15)262Kctx(+102K)

Qwen3 Coder 480B A35B (free)Jul 2025

$0.00/M in(-0.22)262Kctx(0K)3 benchmarks

Qwen3 Coder FlashSep 2025

$0.20/M in(+0.20)1.0Mctx(+738K)

Qwen3 Coder NextFeb 2026

$0.15/M in(-0.05)262Kctx(-738K)3 benchmarks

Qwen3 Coder PlusSep 2025

$0.65/M in(+0.50)1.0Mctx(+738K)

Qwen3 MaxSep 2025

$0.78/M in(+0.13)262Kctx(-738K)8 benchmarks

Qwen3 Max ThinkingFeb 2026

$0.78/M in262Kctx3 benchmarks

Qwen3 Next 80B A3B InstructSep 2025

$0.09/M in(-0.69)262Kctx18 benchmarks

Qwen3 Next 80B A3B Instruct (free)Sep 2025

$0.00/M in(-0.09)262Kctx3 benchmarks

Qwen3 Next 80B A3B ThinkingSep 2025

$0.10/M in(+0.10)131Kctx(-131K)20 benchmarks

Qwen3 VL 235B A22B InstructSep 2025

$0.20/M in(+0.10)262Kctx(+131K)1 benchmark

Qwen3 VL 235B A22B ThinkingSep 2025

$0.26/M in(+0.06)131Kctx(-131K)1 benchmark

Qwen3 VL 30B A3B InstructOct 2025

$0.13/M in(-0.13)131Kctx

Qwen3 VL 30B A3B ThinkingOct 2025

$0.13/M in131Kctx

Qwen3 VL 32B InstructOct 2025

$0.10/M in(-0.03)131Kctx

Qwen3 VL 8B InstructOct 2025

$0.08/M in(-0.02)131Kctx

Qwen3 VL 8B ThinkingOct 2025

$0.12/M in(+0.04)131Kctx