How much does Qwen3 235B A22B Thinking 2507 cost?

Qwen3 235B A22B Thinking 2507 costs $0.10 per million input tokens and $0.10 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.000 per message.

What benchmarks has Qwen3 235B A22B Thinking 2507 been tested on?

Qwen3 235B A22B Thinking 2507 has been evaluated on 24 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1399.2, OpenCompass — AIME2025: 90.9, OpenCompass — IFEval: 87.8.

Is Qwen3 235B A22B Thinking 2507 open source?

Yes, Qwen3 235B A22B Thinking 2507 is open source.

How does Qwen3 235B A22B Thinking 2507 compare to Qwen2-72B?

Qwen3 235B A22B Thinking 2507 has an average score of 58.4 while Qwen2-72B scores 58.4. Qwen2-72B slightly outperforms Qwen3 235B A22B Thinking 2507 overall. See full comparison →

Home/Models/Qwen3 235B A22B Thinking 2507

Qwen3 235B A22B Thinking 2507

Name: Qwen3 235B A22B Thinking 2507
Price: 0.1 USD
Author: Alibaba Qwen

by Alibaba Qwen · Released Jul 2025

Open Source

58.4

avg score

Rank #95

Compare

Better than 65% of all models

Context

262K tokens (~131 books)

Input $/1M

$0.10

Output $/1M

$0.10

Type

text

License

Open Source

Benchmarks

24 tested

Data updated today

About

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

Tested on 24 benchmarks with 55.9% average. Top scores: Chatbot Arena Elo — Overall (1399.2%), OpenCompass — AIME2025 (90.9%), OpenCompass — IFEval (87.8%).

Capabilities

coding

46.8

#96 globally

reasoning

55.8

#55 globally

math

51.9

#84 globally

knowledge

58.9

#69 globally

language

66.0

#92 globally

Benchmark Scores

Compare All

Tested on 24 benchmarks · Ranked across 6 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

codingCompare coding →

OpenCompass — LiveCodeBenchV6

OpenCompass Live Code Bench v6. Fresh competitive programming problems to evaluate code generation without memorization.

70.6—

LiveBench — Coding

Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.

69.0—

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

41.0—

reasoningCompare reasoning →

LiveBench — Reasoning

Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.

59.4—

LiveBench — Data Analysis

Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.

52.2—

mathCompare math →

OpenCompass — AIME2025

OpenCompass evaluation on AIME 2025 problems. Tests mathematical reasoning on fresh competition problems.

90.9—

OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

86.7—

LiveBench — Mathematics

Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.

73.4—

Quick compare:

vs Qwen2-72B

vs DeepSeek V3

vs Kimi K2.5

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Alibaba Qwen Qwen 3

Qwen3 14BApr 2025

$0.06/M in41Kctx

Qwen3 235B A22BApr 2025

56.4+56.4

$0.46/M in(+0.40)131Kctx(+90K)8 benchmarks

Qwen3 235B A22B Instruct 2507Jul 2025

48.5-7.9

$0.07/M in(-0.38)262Kctx(+131K)20 benchmarks

Qwen3 235B A22B Thinking 2507Jul 2025

55.9+7.4

$0.15/M in(+0.08)131Kctx(-131K)24 benchmarks

Qwen3 30B A3BApr 2025

$0.09/M in(-0.06)41Kctx(-90K)1 benchmark

Qwen3 30B A3B Instruct 2507Jul 2025

55.3+55.3

$0.09/M in262Kctx(+221K)7 benchmarks

Qwen3 30B A3B Thinking 2507Aug 2025

66.2+10.9

$0.08/M in(-0.01)131Kctx(-131K)6 benchmarks

Qwen3 32BApr 2025

58.2-8.0

$0.08/M in41Kctx(-90K)8 benchmarks

Qwen3 4B (free)Apr 2025

$0.00/M in(-0.08)41Kctx

Qwen3 8BApr 2025

56.5+56.5

$0.05/M in(+0.05)41Kctx6 benchmarks

Qwen3 Coder 30B A3B InstructJul 2025

$0.07/M in(+0.02)160Kctx(+119K)

Qwen3 Coder 480B A35BJul 2025

$0.22/M in(+0.15)262Kctx(+102K)

Qwen3 Coder 480B A35B (free)Jul 2025

$0.00/M in(-0.22)262Kctx(0K)3 benchmarks

Qwen3 Coder FlashSep 2025

$0.20/M in(+0.20)1.0Mctx(+738K)

Qwen3 Coder NextFeb 2026

$0.12/M in(-0.08)262Kctx(-738K)3 benchmarks

Qwen3 Coder PlusSep 2025

$0.65/M in(+0.53)1.0Mctx(+738K)

Qwen3 MaxSep 2025

58.3+58.3

$0.78/M in(+0.13)262Kctx(-738K)8 benchmarks

Qwen3 Max ThinkingFeb 2026

$0.78/M in262Kctx3 benchmarks

Qwen3 Next 80B A3B InstructSep 2025

54.4+54.4

$0.09/M in(-0.69)262Kctx18 benchmarks

Qwen3 Next 80B A3B Instruct (free)Sep 2025

$0.00/M in(-0.09)262Kctx3 benchmarks

Qwen3 Next 80B A3B ThinkingSep 2025

61.6+61.6

$0.10/M in(+0.10)131Kctx(-131K)20 benchmarks

Qwen3 VL 235B A22B InstructSep 2025

$0.20/M in(+0.10)262Kctx(+131K)1 benchmark

Qwen3 VL 235B A22B ThinkingSep 2025

$0.26/M in(+0.06)131Kctx(-131K)1 benchmark

Qwen3 VL 30B A3B InstructOct 2025

$0.13/M in(-0.13)131Kctx

Qwen3 VL 30B A3B ThinkingOct 2025

$0.13/M in131Kctx

Qwen3 VL 32B InstructOct 2025

$0.10/M in(-0.03)131Kctx

Qwen3 VL 8B InstructOct 2025

$0.08/M in(-0.02)131Kctx

Qwen3 VL 8B ThinkingOct 2025

$0.12/M in(+0.04)131Kctx

See the full Qwen 3 family →

Similar Models

Links

Info

Alibaba Qwen Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

qwen3-235b-a22b-thinking-2507

Specifications

Typetext
Context262K tokens (~131 books)
ReleasedJul 2025
LicenseOpen Source
StatusActive
Cost / Message~$0.000

Available On

Alibaba Qwen$0.10

Frequently Asked Questions

Qwen3 235B A22B Thinking 2507 is an open-source text AI model by Alibaba Qwen, released in July 2025. It has an average benchmark score of 58.4. Context window: 262K tokens.

Benchmarks

Chatbot Arena Elo — Overall OpenCompass — AIME2025 OpenCompass — IFEval OTIS Mock AIME 2024-2025 Lech Mazur Writing

Alibaba Qwen · Provider Alibaba Qwen · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

Qwen3 235B A22B Thinking 2507

Frequently Asked Questions

Related Models

Benchmarks

Related Pages