LIVE268社のプ��バイダーから976のAIモデルを追跡中。

BenchGeckoベータ

モデル976·プロバイダー268·ベンチマーク128·企業71·エージェント165·トップQwen3 VL 235B A22B Instruct · 1415.8%·更新たった今·データポイント2,902·MCPサーバー4,923

ランキング/Qwen3 235B A22B Instruct 2507

Qwen3 235B A22B Instruct 2507

オープンソース

開発元 Alibaba Qwen · リリース日 2025-07-21

48.5

平均スコア

$0.07/1M

入力料金

$0.10/1M

出力料金

262K tokens (~131 books)

コンテキストウィンドウ

text

タイプ

Tested on 20 benchmarks with 48.5% average. Top scores: Chatbot Arena Elo — Overall (1422.6%), OpenCompass — IFEval (88.3%), OpenCompass — MMLU-Pro (79.2%).

ベンチマークスコア

ベンチマーク	カテゴリ	スコア	Bar
Chatbot Arena Elo — Overall	arena	1422.6
OpenCompass — IFEval	language	88.3
OpenCompass — MMLU-Pro	knowledge	79.2
OpenCompass — GPQA-Diamond	knowledge	75.5
LiveBench — Coding	coding	69.6
OpenCompass — AIME2025	math	69.5
LiveBench — Mathematics	math	68.0
LiveBench — Language	language	66.1
Aider polyglot	coding	59.6
LiveBench — Reasoning	reasoning	58.4
Fiction.LiveBench	knowledge	52.9
LiveBench — Overall	knowledge	48.8
LiveBench — Data Analysis	reasoning	44.7
OpenCompass — LiveCodeBenchV6	coding	43.0
WeirdML	coding	38.7
LiveBench — If	language	21.7
LiveBench — Agentic Coding	coding	13.3
OpenCompass — HLE	knowledge	12.3
ARC-AGI	reasoning	11.0
ARC-AGI-2	reasoning	1.3

類似モデル

Qwen2.5 72B Instruct Abliterated

Gemini 2.0 Flash

Google DeepMind

Gemini 3 Flash Preview

Google DeepMind

Stable Beluga 2

Alibaba Qwen Qwen 3 タイムライン

Qwen3 14BApr 2025

$0.06/M in41Kctx

Qwen3 235B A22BApr 2025

$0.46/M in(+0.40)131Kctx(+90K)8 benchmarks

Qwen3 235B A22B Instruct 2507Jul 2025

$0.07/M in(-0.38)262Kctx(+131K)20 benchmarks

Qwen3 235B A22B Thinking 2507Jul 2025

$0.13/M in(+0.06)262Kctx24 benchmarks

Qwen3 30B A3BApr 2025

$0.08/M in(-0.05)41Kctx(-221K)1 benchmark

Qwen3 30B A3B Instruct 2507Jul 2025

$0.09/M in(+0.01)262Kctx(+221K)7 benchmarks

Qwen3 30B A3B Thinking 2507Aug 2025

$0.08/M in(-0.01)131Kctx(-131K)6 benchmarks

Qwen3 32BApr 2025

$0.08/M in41Kctx(-90K)8 benchmarks

Qwen3 4B (free)Apr 2025

$0.00/M in(-0.08)41Kctx

Qwen3 8BApr 2025

$0.05/M in(+0.05)41Kctx6 benchmarks

Qwen3 Coder 30B A3B InstructJul 2025

$0.07/M in(+0.02)160Kctx(+119K)

Qwen3 Coder 480B A35BJul 2025

$0.22/M in(+0.15)262Kctx(+102K)

Qwen3 Coder 480B A35B (free)Jul 2025

$0.00/M in(-0.22)262Kctx(0K)3 benchmarks

Qwen3 Coder FlashSep 2025

$0.20/M in(+0.20)1.0Mctx(+738K)

Qwen3 Coder NextFeb 2026

$0.15/M in(-0.05)262Kctx(-738K)3 benchmarks

Qwen3 Coder PlusSep 2025

$0.65/M in(+0.50)1.0Mctx(+738K)

Qwen3 MaxSep 2025

$0.78/M in(+0.13)262Kctx(-738K)8 benchmarks

Qwen3 Max ThinkingFeb 2026

$0.78/M in262Kctx3 benchmarks

Qwen3 Next 80B A3B InstructSep 2025

$0.09/M in(-0.69)262Kctx18 benchmarks

Qwen3 Next 80B A3B Instruct (free)Sep 2025

$0.00/M in(-0.09)262Kctx3 benchmarks

Qwen3 Next 80B A3B ThinkingSep 2025

$0.10/M in(+0.10)131Kctx(-131K)20 benchmarks

Qwen3 VL 235B A22B InstructSep 2025

$0.20/M in(+0.10)262Kctx(+131K)1 benchmark

Qwen3 VL 235B A22B ThinkingSep 2025

$0.26/M in(+0.06)131Kctx(-131K)1 benchmark

Qwen3 VL 30B A3B InstructOct 2025

$0.13/M in(-0.13)131Kctx

Qwen3 VL 30B A3B ThinkingOct 2025

$0.13/M in131Kctx

Qwen3 VL 32B InstructOct 2025

$0.10/M in(-0.03)131Kctx

Qwen3 VL 8B InstructOct 2025

$0.08/M in(-0.02)131Kctx

Qwen3 VL 8B ThinkingOct 2025

$0.12/M in(+0.04)131Kctx