Compare · ModelsLive · 2 picked · head to head
Grok 4 vs gpt-oss-120b (free)
Side by side · benchmarks, pricing, and signals you can act on.
Winner summary
Grok 4 wins on 1/1 benchmarks
Grok 4 wins 1 of 1 shared benchmarks. Leads in coding.
Category leads
coding·Grok 4
Hype vs Reality
Attention vs performance
Grok 4
#71 by perf·no signal
gpt-oss-120b (free)
#20 by perf·no signal
Vendor risk
Who is behind the model
xAI
$250.0B·Tier 1
OpenAI
$840.0B·Tier 1
Head to head
1 benchmark · 2 models
Grok 4gpt-oss-120b (free)
Aider polyglot
Grok 4 leads by +37.8
Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework.
Grok 4
79.6
gpt-oss-120b (free)
41.8
Full benchmark table
| Benchmark | Grok 4 | gpt-oss-120b (free) |
|---|---|---|
Aider polyglot Aider Polyglot · measures how well AI models can edit code across multiple programming languages using the Aider coding assistant framework. | 79.6 | 41.8 |
Pricing · per 1M tokens · projected $/mo at 10M tokens
| Model | Input | Output | Context | Projected $/mo |
|---|---|---|---|---|
| $3.00 | $15.00 | 256K tokens (~128 books) | $60.00 | |
| $0.00 | $0.00 | 131K tokens (~66 books) | — |