Compare · ModelsLive · 2 picked · head to head
Qwen3 Max vs Gemini 3 Flash Preview
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Gemini 3 Flash Preview wins on 3/4 benchmarks
Gemini 3 Flash Preview wins 3 of 4 shared benchmarks. Leads in knowledge · math.
Category leads
knowledge·Gemini 3 Flash Previewmath·Gemini 3 Flash Preview
Hype vs Reality
Attention vs performance
Qwen3 Max
#49 by perf·no signal
Gemini 3 Flash Preview
#98 by perf·no signal
Best value
Gemini 3 Flash Preview
1.1x better value than Qwen3 Max
Qwen3 Max
24.9 pts/$
$2.34/M
Gemini 3 Flash Preview
28.1 pts/$
$1.75/M
Vendor risk
Who is behind the model
Alibaba (Qwen)
$293.0B·Tier 1
Google DeepMind
$4.00T·Tier 1
Head to head
4 benchmarks · 2 models
Qwen3 MaxGemini 3 Flash Preview
Chess Puzzles
Gemini 3 Flash Preview leads by +34.0
Chess Puzzles · tests strategic and tactical reasoning by having models solve chess puzzle positions, evaluating lookahead and pattern recognition abilities.
Qwen3 Max
4.0
Gemini 3 Flash Preview
38.0
GPQA diamond
Gemini 3 Flash Preview leads by +14.1
Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs.
Qwen3 Max
63.5
Gemini 3 Flash Preview
77.6
OTIS Mock AIME 2024-2025
Gemini 3 Flash Preview leads by +19.5
OTIS Mock AIME 2024-2025 · simulated American Invitational Mathematics Examination problems testing advanced problem-solving skills.
Qwen3 Max
73.3
Gemini 3 Flash Preview
92.8
SimpleQA Verified
Qwen3 Max leads by +0.1
SimpleQA Verified · short factual questions with verified answers, measuring factual accuracy and the tendency to hallucinate or provide incorrect information.
Qwen3 Max
67.5
Gemini 3 Flash Preview
67.4
Full benchmark table
| Benchmark | Qwen3 Max | Gemini 3 Flash Preview |
|---|---|---|
Chess Puzzles Chess Puzzles · tests strategic and tactical reasoning by having models solve chess puzzle positions, evaluating lookahead and pattern recognition abilities. | 4.0 | 38.0 |
GPQA diamond Graduate-Level Google-Proof QA (Diamond set) · expert-crafted questions in physics, biology, and chemistry that are difficult even for domain PhDs. | 63.5 | 77.6 |
OTIS Mock AIME 2024-2025 OTIS Mock AIME 2024-2025 · simulated American Invitational Mathematics Examination problems testing advanced problem-solving skills. | 73.3 | 92.8 |
SimpleQA Verified SimpleQA Verified · short factual questions with verified answers, measuring factual accuracy and the tendency to hallucinate or provide incorrect information. | 67.5 | 67.4 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.78 | $3.90 | 262K tokens (~131 books) | $15.60 | |
| $0.50 | $3.00 | 1.0M tokens (~524 books) | $11.25 |
People also compared