How much does GPT-4o-mini (2024-07-18) cost?

GPT-4o-mini (2024-07-18) costs $0.15 per million input tokens and $0.60 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.001 per message.

What benchmarks has GPT-4o-mini (2024-07-18) been tested on?

GPT-4o-mini (2024-07-18) has been evaluated on 20 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1317.2, GSM8K: 91.3, HELM — WildBench: 79.1.

Is GPT-4o-mini (2024-07-18) open source?

No, GPT-4o-mini (2024-07-18) is a proprietary model by OpenAI.

How does GPT-4o-mini (2024-07-18) compare to Gemini 2.5 Flash Lite?

GPT-4o-mini (2024-07-18) has an average score of 33.4 while Gemini 2.5 Flash Lite scores 33.2. GPT-4o-mini (2024-07-18) outperforms Gemini 2.5 Flash Lite overall. GPT-4o-mini (2024-07-18) costs $0.15/1M input vs Gemini 2.5 Flash Lite at $0.10/1M input. See full comparison →

Home/Models/GPT-4o-mini (2024-07-18)

GPT-4o-mini (2024-07-18)

Name: GPT-4o-mini (2024-07-18)
Price: 0.15 USD
Author: OpenAI

by OpenAI · Released Jul 2024

Multimodal

33.4

avg score

Rank #168

Compare

Better than 28% of all models

Context

128K tokens (~64 books)

Input $/1M

$0.15

Output $/1M

$0.60

Type

multimodal

License

Proprietary

Benchmarks

20 tested

Data updated today

About

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

Tested on 20 benchmarks with 43.2% average. Top scores: Chatbot Arena Elo — Overall (1317.2%), GSM8K (91.3%), HELM — WildBench (79.1%).

Looking for similar performance at lower cost?
Llama 3.1 8B Instruct scores 34.3 (103% as good) at $0.02/1M input · 87% cheaper

Capabilities

coding

7.7

#139 globally

reasoning

39.6

#67 globally

math

44.7

#93 globally

knowledge

46.3

#118 globally

multimodal

53.1

#7 globally

language

78.2

#54 globally

Benchmark Scores

Compare All

Tested on 20 benchmarks · Ranked across 7 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

11.8—

Aider polyglot

Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.

3.6—

reasoningCompare reasoning →

HELM — WildBench

Stanford HELM WildBench evaluation. Tests reasoning on challenging real-world tasks.

79.1—

ARC-AGI-2

ARC-AGI 2, harder sequel to ARC. More complex abstract reasoning patterns that test generalization ability beyond training data.

0.1—

mathCompare math →

GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

91.3—

MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

52.6—

HELM — Omni-MATH

Stanford HELM evaluation of mathematical reasoning across diverse problem types.

28.0—

Quick compare:

vs Gemini 2.5 Flash Lite

vs Gemini 2.0 Flash Thinking (Jan 2025)

vs Llama 3.1 8B Instruct

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · OpenAI GPT-4o

GPT-4oMay 2024

$2.50/M in128Kctx

GPT-4o (2024-05-13)May 2024

51.1+51.1

$5.00/M in(+2.50)128Kctx8 benchmarks

GPT-4o (2024-08-06)Aug 2024

35.6-15.5

$2.50/M in(-2.50)128Kctx11 benchmarks

GPT-4o (2024-11-20)Nov 2024

37.7+2.1

$2.50/M in128Kctx28 benchmarks

GPT-4o (extended)May 2024

$6.00/M in(+3.50)128Kctx

GPT-4o AudioAug 2025

$2.50/M in(-3.50)128Kctx

GPT-4o Search PreviewMar 2025

$2.50/M in128Kctx

GPT-4o-miniJul 2024

39.6+39.6

$0.15/M in(-2.35)128Kctx15 benchmarks

GPT-4o-mini (2024-07-18)Jul 2024

43.2+3.6

$0.15/M in128Kctx20 benchmarks

GPT-4o-mini Search PreviewMar 2025

$0.15/M in128Kctx

See the full GPT-4o family →

Similar Models

Gemini 2.5 Flash Lite

Google DeepMind

33.2$0.10/1M

Gemini 2.0 Flash Thinking (Jan 2025)

Google DeepMind

32.9TBD

Llama 3.1 8B Instruct

Frequently Asked Questions

GPT-4o-mini (2024-07-18) is a proprietary multimodal AI model by OpenAI, released in July 2024. It has an average benchmark score of 33.4. Context window: 128K tokens.

Benchmarks

Chatbot Arena Elo — Overall GSM8K HELM — WildBench HELM — IFEval PIQA

OpenAI · Provider OpenAI · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

GPT-4o-mini (2024-07-18)

Frequently Asked Questions

Related Models

Benchmarks

Related Pages