测试版
排行榜/Qwen3 235B A22B Instruct 2507
Alibaba Qwen logo

Qwen3 235B A22B Instruct 2507

开源

来自 Alibaba Qwen · 发布于 2025-07-21

48.5
平均分
$0.07/1M
输入价格
$0.10/1M
输出价格
262K tokens (~131 books)
上下文窗口
text
类型

Tested on 20 benchmarks with 48.5% average. Top scores: Chatbot Arena Elo — Overall (1422.6%), OpenCompass — IFEval (88.3%), OpenCompass — MMLU-Pro (79.2%).

基准测试类别分数Bar
Chatbot Arena Elo — Overallarena1422.6
OpenCompass — IFEvallanguage88.3
OpenCompass — MMLU-Proknowledge79.2
OpenCompass — GPQA-Diamondknowledge75.5
LiveBench — Codingcoding69.6
OpenCompass — AIME2025math69.5
LiveBench — Mathematicsmath68.0
LiveBench — Languagelanguage66.1
Aider polyglotcoding59.6
LiveBench — Reasoningreasoning58.4
Fiction.LiveBenchknowledge52.9
LiveBench — Overallknowledge48.8
LiveBench — Data Analysisreasoning44.7
OpenCompass — LiveCodeBenchV6coding43.0
WeirdMLcoding38.7
LiveBench — Iflanguage21.7
LiveBench — Agentic Codingcoding13.3
OpenCompass — HLEknowledge12.3
ARC-AGIreasoning11.0
ARC-AGI-2reasoning1.3