Compare · ModelsLive · 2 picked · head to head
Gemma 4 31B vs GPT-5.1-Codex-Max
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
GPT-5.1-Codex-Max wins on 6/8 benchmarks
GPT-5.1-Codex-Max wins 6 of 8 shared benchmarks. Leads in coding · math · knowledge.
Category leads
coding·GPT-5.1-Codex-Maxreasoning·Gemma 4 31Blanguage·Gemma 4 31Bmath·GPT-5.1-Codex-Maxknowledge·GPT-5.1-Codex-Max
Hype vs Reality
Attention vs performance
Gemma 4 31B
#31 by perf·no signal
GPT-5.1-Codex-Max
#10 by perf·no signal
Best value
Gemma 4 31B
18.9x better value than GPT-5.1-Codex-Max
Gemma 4 31B
241.6 pts/$
$0.26/M
GPT-5.1-Codex-Max
12.8 pts/$
$5.63/M
Vendor risk
Who is behind the model
Google DeepMind
$4.00T·Tier 1
OpenAI
$840.0B·Tier 1
Head to head
8 benchmarks · 2 models
Gemma 4 31BGPT-5.1-Codex-Max
LiveBench · Agentic Coding
GPT-5.1-Codex-Max leads by +16.7
Gemma 4 31B
40.0
GPT-5.1-Codex-Max
56.7
LiveBench · Coding
GPT-5.1-Codex-Max leads by +21.0
Gemma 4 31B
60.3
GPT-5.1-Codex-Max
81.4
LiveBench · Data Analysis
Gemma 4 31B leads by +3.9
Gemma 4 31B
58.8
GPT-5.1-Codex-Max
54.9
LiveBench · If
Gemma 4 31B leads by +0.5
Gemma 4 31B
67.6
GPT-5.1-Codex-Max
67.1
LiveBench · Language
GPT-5.1-Codex-Max leads by +4.0
Gemma 4 31B
71.3
GPT-5.1-Codex-Max
75.4
LiveBench · Mathematics
GPT-5.1-Codex-Max leads by +9.7
Gemma 4 31B
73.9
GPT-5.1-Codex-Max
83.7
LiveBench · Overall
GPT-5.1-Codex-Max leads by +10.3
Gemma 4 31B
61.6
GPT-5.1-Codex-Max
72.0
LiveBench · Reasoning
GPT-5.1-Codex-Max leads by +25.1
Gemma 4 31B
59.4
GPT-5.1-Codex-Max
84.6
Full benchmark table
| Benchmark | Gemma 4 31B | GPT-5.1-Codex-Max |
|---|---|---|
LiveBench · Agentic Coding | 40.0 | 56.7 |
LiveBench · Coding | 60.3 | 81.4 |
LiveBench · Data Analysis | 58.8 | 54.9 |
LiveBench · If | 67.6 | 67.1 |
LiveBench · Language | 71.3 | 75.4 |
LiveBench · Mathematics | 73.9 | 83.7 |
LiveBench · Overall | 61.6 | 72.0 |
LiveBench · Reasoning | 59.4 | 84.6 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $0.13 | $0.38 | 262K tokens (~131 books) | $1.93 | |
| $1.25 | $10.00 | 400K tokens (~200 books) | $34.38 |
People also compared
GPT-5.1-Codex-Max vs GPT-5 ChatClaude Mythos Preview vs GPT-5.1-Codex-MaxGPT-5.1-Codex-Max vs Qwen3.5 397B A17BDeepSeek V3.2 Speciale vs GPT-5.1-Codex-MaxClaude Instant vs GPT-5.1-Codex-MaxGPT-5.1-Codex-Max vs Step 3.5 FlashDeepSeek-V2 (MoE-236B, May 2024) vs GPT-5.1-Codex-MaxGPT-5.1-Codex-Max vs MiMo-V2-Flash