LIVE268社のプ��バイダーから976のAIモデルを追跡中。

BenchGeckoベータ

モデル976·プロバイダー268·ベンチマーク128·企業71·エージェント165·トップQwen3 VL 235B A22B Instruct · 1415.8%·更新たった今·データポイント2,902·MCPサーバー4,923

ランキング/Qwen3 235B A22B Thinking 2507

Qwen3 235B A22B Thinking 2507

オープンソース

開発元 Alibaba Qwen · リリース日 2025-07-25

55.9

平均スコア

$0.13/1M

入力料金

$0.60/1M

出力料金

262K tokens (~131 books)

コンテキストウィンドウ

text

タイプ

Tested on 24 benchmarks with 55.9% average. Top scores: Chatbot Arena Elo — Overall (1399.8%), OpenCompass — AIME2025 (90.9%), OpenCompass — IFEval (87.8%).

ベンチマークスコア

ベンチマーク	カテゴリ	スコア	Bar
Chatbot Arena Elo — Overall	arena	1399.8
OpenCompass — AIME2025	math	90.9
OpenCompass — IFEval	language	87.8
OTIS Mock AIME 2024-2025	math	86.7
Lech Mazur Writing	knowledge	85.0
OpenCompass — MMLU-Pro	knowledge	83.5
OpenCompass — GPQA-Diamond	knowledge	79.8
Fiction.LiveBench	knowledge	75.0
GPQA diamond	knowledge	73.4
LiveBench — Mathematics	math	73.4
OpenCompass — LiveCodeBenchV6	coding	70.6
LiveBench — Language	language	69.5
LiveBench — Coding	coding	69.0
LiveBench — Reasoning	reasoning	59.4
LiveBench — Overall	knowledge	53.0
LiveBench — Data Analysis	reasoning	52.2
SimpleQA Verified	knowledge	50.1
WeirdML	coding	41.0
LiveBench — If	language	40.6
OpenCompass — HLE	knowledge	18.5
Chess Puzzles	knowledge	12.0
FrontierMath-2025-02-28-Private	math	8.5
LiveBench — Agentic Coding	coding	6.7
FrontierMath-Tier-4-2025-07-01-Private	math	0.1

類似モデル

Mistral Small 3.1 24B

DeepSeek R1 Distill Qwen 14B

Alibaba Qwen Qwen 3 タイムライン

Qwen3 14BApr 2025

$0.06/M in41Kctx

Qwen3 235B A22BApr 2025

$0.46/M in(+0.40)131Kctx(+90K)8 benchmarks

Qwen3 235B A22B Instruct 2507Jul 2025

$0.07/M in(-0.38)262Kctx(+131K)20 benchmarks

Qwen3 235B A22B Thinking 2507Jul 2025

$0.13/M in(+0.06)262Kctx24 benchmarks

Qwen3 30B A3BApr 2025

$0.08/M in(-0.05)41Kctx(-221K)1 benchmark

Qwen3 30B A3B Instruct 2507Jul 2025

$0.09/M in(+0.01)262Kctx(+221K)7 benchmarks

Qwen3 30B A3B Thinking 2507Aug 2025

$0.08/M in(-0.01)131Kctx(-131K)6 benchmarks

Qwen3 32BApr 2025

$0.08/M in41Kctx(-90K)8 benchmarks

Qwen3 4B (free)Apr 2025

$0.00/M in(-0.08)41Kctx

Qwen3 8BApr 2025

$0.05/M in(+0.05)41Kctx6 benchmarks

Qwen3 Coder 30B A3B InstructJul 2025

$0.07/M in(+0.02)160Kctx(+119K)

Qwen3 Coder 480B A35BJul 2025

$0.22/M in(+0.15)262Kctx(+102K)

Qwen3 Coder 480B A35B (free)Jul 2025

$0.00/M in(-0.22)262Kctx(0K)3 benchmarks

Qwen3 Coder FlashSep 2025

$0.20/M in(+0.20)1.0Mctx(+738K)

Qwen3 Coder NextFeb 2026

$0.15/M in(-0.05)262Kctx(-738K)3 benchmarks

Qwen3 Coder PlusSep 2025

$0.65/M in(+0.50)1.0Mctx(+738K)

Qwen3 MaxSep 2025

$0.78/M in(+0.13)262Kctx(-738K)8 benchmarks

Qwen3 Max ThinkingFeb 2026

$0.78/M in262Kctx3 benchmarks

Qwen3 Next 80B A3B InstructSep 2025

$0.09/M in(-0.69)262Kctx18 benchmarks

Qwen3 Next 80B A3B Instruct (free)Sep 2025

$0.00/M in(-0.09)262Kctx3 benchmarks

Qwen3 Next 80B A3B ThinkingSep 2025

$0.10/M in(+0.10)131Kctx(-131K)20 benchmarks

Qwen3 VL 235B A22B InstructSep 2025

$0.20/M in(+0.10)262Kctx(+131K)1 benchmark

Qwen3 VL 235B A22B ThinkingSep 2025

$0.26/M in(+0.06)131Kctx(-131K)1 benchmark

Qwen3 VL 30B A3B InstructOct 2025

$0.13/M in(-0.13)131Kctx

Qwen3 VL 30B A3B ThinkingOct 2025

$0.13/M in131Kctx

Qwen3 VL 32B InstructOct 2025

$0.10/M in(-0.03)131Kctx

Qwen3 VL 8B InstructOct 2025

$0.08/M in(-0.02)131Kctx

Qwen3 VL 8B ThinkingOct 2025

$0.12/M in(+0.04)131Kctx