Home/Models/Qwen3 235B A22B Thinking 2507
Alibaba Qwen logo

Qwen3 235B A22B Thinking 2507

by Alibaba Qwen · Released Jul 2025

Open Source
59.4
avg score
Rank #69
Compare
Better than 70% of all models
Context
131K tokens (~66 books)
Input $/1M
$0.15
Output $/1M
$1.50
Type
text
License
Open Source
Benchmarks
24 tested
Data updated today
About

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

Tested on 24 benchmarks with 55.9% average. Top scores: Chatbot Arena Elo — Overall (1399.8%), OpenCompass — AIME2025 (90.9%), OpenCompass — IFEval (87.8%).

Capabilities
coding
46.8
#77 globally
reasoning
55.8
#40 globally
math
51.9
#70 globally
knowledge
58.9
#55 globally
language
66.0
#78 globally
Benchmark Scores
Compare All
Tested on 24 benchmarks · Ranked across 6 categories
Score Distribution (all 233 models)
0255075100
▲ You are here
OpenCompass — LiveCodeBenchV6

OpenCompass Live Code Bench v6. Fresh competitive programming problems to evaluate code generation without memorization.

70.6
LiveBench — Coding

Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.

69.0
WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

41.0
LiveBench — Reasoning

Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.

59.4
LiveBench — Data Analysis

Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.

52.2
OpenCompass — AIME2025

OpenCompass evaluation on AIME 2025 problems. Tests mathematical reasoning on fresh competition problems.

90.9
OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

86.7
LiveBench — Mathematics

Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.

73.4
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Model Family · Alibaba Qwen Qwen 3
See the full Qwen 3 family →
Links
Documentation
Community
BenchGecko API
qwen3-235b-a22b-thinking-2507
Specifications
  • Typetext
  • Context131K tokens (~66 books)
  • ReleasedJul 2025
  • LicenseOpen Source
  • StatusActive
  • Cost / Message~$0.002
Available On
Alibaba Qwen logoAlibaba Qwen$0.15
Share & Export
Tweet
Qwen3 235B A22B Thinking 2507 is an open-source text AI model by Alibaba Qwen, released in July 2025. It has an average benchmark score of 59.4. Context window: 131K tokens.