Compare · ModelsLive · 2 picked · head to head

Gemma 4 31B vs GPT-5.1-Codex-Max

Side by side · benchmarks, pricing, and signals you can act on.

CiteAdd another

Winner summary

GPT-5.1-Codex-Max wins on 6/8 benchmarks

GPT-5.1-Codex-Max wins 6 of 8 shared benchmarks. Leads in coding · math · knowledge.

Category leads

coding·GPT-5.1-Codex-Maxreasoning·Gemma 4 31Blanguage·Gemma 4 31Bmath·GPT-5.1-Codex-Maxknowledge·GPT-5.1-Codex-Max

Hype vs Reality

Attention vs performance

Gemma 4 31B

#33 by perf·no signal

QUIET

GPT-5.1-Codex-Max

#12 by perf·no signal

QUIET

See full mindshare →

Best value

Gemma 4 31B

18.9x better value than GPT-5.1-Codex-Max

Gemma 4 31B

241.6 pts/$

$0.26/M

GPT-5.1-Codex-Max

12.8 pts/$

$5.63/M

Explore pricing →

Vendor risk

Who is behind the model

Google DeepMind

$4.00T·Tier 1

Low risk

OpenAI

$840.0B·Tier 1

Medium risk

See the AI economy →

Head to head

8 benchmarks · 2 models

Gemma 4 31BGPT-5.1-Codex-Max

LiveBench · Agentic Coding

GPT-5.1-Codex-Max leads by +16.7

Gemma 4 31B

40.0

GPT-5.1-Codex-Max

56.7

LiveBench · Coding

GPT-5.1-Codex-Max leads by +21.0

Gemma 4 31B

60.3

GPT-5.1-Codex-Max

81.4

LiveBench · Data Analysis

Gemma 4 31B leads by +3.9

Gemma 4 31B

58.8

GPT-5.1-Codex-Max

54.9

LiveBench · If

Gemma 4 31B leads by +0.5

Gemma 4 31B

67.6

GPT-5.1-Codex-Max

67.1

LiveBench · Language

GPT-5.1-Codex-Max leads by +4.0

Gemma 4 31B

71.3

GPT-5.1-Codex-Max

75.4

LiveBench · Mathematics

GPT-5.1-Codex-Max leads by +9.7

Gemma 4 31B

73.9

GPT-5.1-Codex-Max

83.7

LiveBench · Overall

GPT-5.1-Codex-Max leads by +10.3

Gemma 4 31B

61.6

GPT-5.1-Codex-Max

72.0

LiveBench · Reasoning

GPT-5.1-Codex-Max leads by +25.1

Gemma 4 31B

59.4

GPT-5.1-Codex-Max

84.6

Full benchmark table

Benchmark	Gemma 4 31B	GPT-5.1-Codex-Max
LiveBench · Agentic Coding	40.0	56.7
LiveBench · Coding	60.3	81.4
LiveBench · Data Analysis	58.8	54.9
LiveBench · If	67.6	67.1
LiveBench · Language	71.3	75.4
LiveBench · Mathematics	73.9	83.7
LiveBench · Overall	61.6	72.0
LiveBench · Reasoning	59.4	84.6

Pricing · per 1M tokens · projected $/mo at 10M tokens

Model	Input	Output	Context	Projected $/mo
Gemma 4 31B	$0.13	$0.38	262K tokens (~131 books)	$1.93
GPT-5.1-Codex-Max	$1.25	$10.00	400K tokens (~200 books)	$34.38

People also compared

GPT-5.1-Codex-Max vs GPT-5.5 Pro GPT-5.1-Codex-Max vs GPT-5.5 GPT-5.1-Codex-Max vs GPT-5 Chat Claude Mythos Preview vs GPT-5.1-Codex-Max GPT-5.1-Codex-Max vs Qwen3.5 397B A17B DeepSeek V3.2 Speciale vs GPT-5.1-Codex-Max Claude Instant vs GPT-5.1-Codex-Max Gemma 4 31B vs GPT-5.5 Pro