How much does GPT-4.5 cost?

GPT-4.5 pricing information is not yet available.

What benchmarks has GPT-4.5 been tested on?

GPT-4.5 has been evaluated on 13 benchmarks. Top scores: MATH level 5: 78.6, Lech Mazur Writing: 75.6, Fiction.LiveBench: 63.9.

Is GPT-4.5 open source?

No, GPT-4.5 is a proprietary model by OpenAI.

How does GPT-4.5 compare to Gemma 3 27B (free)?

GPT-4.5 has an average score of 35.2 while Gemma 3 27B (free) scores 35.0. GPT-4.5 outperforms Gemma 3 27B (free) overall. See full comparison →

Home/Models/GPT-4.5

GPT-4.5

Name: GPT-4.5
Author: OpenAI

by OpenAI · Released Jan 2024

35.2

avg score

Rank #164

Compare

Better than 30% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text

License

Proprietary

Benchmarks

13 tested

Data updated today

About

Tested on 13 benchmarks with 35.9% average. Top scores: MATH level 5 (78.6%), Lech Mazur Writing (75.6%), Fiction.LiveBench (63.9%).

Capabilities

coding

33.9

#110 globally

reasoning

10.8

#131 globally

math

58.2

#53 globally

knowledge

43.2

#132 globally

Benchmark Scores

Compare All

Tested on 13 benchmarks · Ranked across 4 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

Aider polyglot

Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.

44.9—

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

39.4—

Cybench

Capture-the-flag cybersecurity challenges. Tests vulnerability analysis, reverse engineering, cryptography, and exploitation skills.

17.5—

reasoningCompare reasoning →

SimpleBench

Deceptively simple questions that humans find easy but AI models often get wrong. Tests common sense and reasoning gaps.

21.4—

ARC-AGI

Abstraction and Reasoning Corpus. Tests fluid intelligence through novel visual pattern recognition puzzles. Core measure of general intelligence.

10.3—

ARC-AGI-2

ARC-AGI 2, harder sequel to ARC. More complex abstract reasoning patterns that test generalization ability beyond training data.

0.8—

mathCompare math →

MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

78.6—

OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

37.7—

Quick compare:

vs Gemma 3 27B (free)

vs Mistral Large 2407

vs Mistral Large 2411

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

Links

Info

OpenAI Pricing explorer Developers · API

Research

Technical Report

Documentation

API Docs Playground

Community

@OpenAI

BenchGecko API

gpt-4-5

Specifications

Typetext
ContextN/A
ReleasedJan 2024
LicenseProprietary
Statusbenchmark-only

Available On

OpenAITBD

Frequently Asked Questions

GPT-4.5 is a proprietary text AI model by OpenAI, released in January 2024. It has an average benchmark score of 35.2.

Benchmarks

MATH level 5 Lech Mazur Writing Fiction.LiveBench GPQA diamond Aider polyglot

OpenAI · Provider OpenAI · Economy All Models Compare Models Pricing Developers · API

GPT-4.5

Frequently Asked Questions

Related Models

Benchmarks

Related Pages