How much does GPT-5 Chat cost?

GPT-5 Chat costs $1.25 per million input tokens and $10.00 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.013 per message.

What benchmarks has GPT-5 Chat been tested on?

GPT-5 Chat has been evaluated on 7 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1426.6, Aider polyglot: 88.0, HELM — IFEval: 87.5.

Is GPT-5 Chat open source?

No, GPT-5 Chat is a proprietary model by OpenAI.

How does GPT-5 Chat compare to Step 3.5 Flash?

GPT-5 Chat has an average score of 89.0 while Step 3.5 Flash scores 89.5. Step 3.5 Flash slightly outperforms GPT-5 Chat overall. GPT-5 Chat costs $1.25/1M input vs Step 3.5 Flash at $0.09/1M input. See full comparison →

Home/Models/GPT-5 Chat

GPT-5 Chat

Name: GPT-5 Chat
Price: 1.25 USD
Author: OpenAI

by OpenAI · Released Aug 2025

Multimodal

89.0

avg score

Rank #8

Compare

Top 3% of all models

Context

128K tokens (~64 books)

Input $/1M

$1.25

Output $/1M

$10.00

Type

multimodal

License

Proprietary

Benchmarks

7 tested

Data updated today

About

GPT-5 Chat is designed for advanced, natural, multimodal, and context-aware conversations for enterprise applications.

Tested on 7 benchmarks with 81.9% average. Top scores: Chatbot Arena Elo — Overall (1426.6%), Aider polyglot (88.0%), HELM — IFEval (87.5%).

Looking for similar performance at lower cost?
Step 3.5 Flash scores 89.5 (101% as good) at $0.09/1M input · 93% cheaper

Capabilities

coding

88.0

#1 globally

reasoning

85.7

#2 globally

math

64.7

#49 globally

knowledge

82.7

#4 globally

language

87.5

#25 globally

Benchmark Scores

Compare All

Tested on 7 benchmarks · Ranked across 6 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

codingCompare coding →

Aider polyglot

Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.

88.0—

reasoningCompare reasoning →

HELM — WildBench

Stanford HELM WildBench evaluation. Tests reasoning on challenging real-world tasks.

85.7—

mathCompare math →

HELM — Omni-MATH

Stanford HELM evaluation of mathematical reasoning across diverse problem types.

64.7—

Quick compare:

vs Step 3.5 Flash

vs GPT-5.4 Pro

vs Qwen2.5 72B Instruct Abliterated

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · OpenAI GPT-5

GPT-5Aug 2025

54.4

$1.25/M in400Kctx26 benchmarks

GPT-5 ChatAug 2025

81.9+27.5

$1.25/M in128Kctx(-272K)7 benchmarks

GPT-5 CodexSep 2025

$1.25/M in400Kctx(+272K)

GPT-5 ImageOct 2025

$10.00/M in(+8.75)400Kctx

GPT-5 Image MiniOct 2025

$2.50/M in(-7.50)400Kctx

GPT-5 MiniAug 2025

56.0+56.0

$0.25/M in(-2.25)400Kctx28 benchmarks

GPT-5 NanoAug 2025

45.3-10.7

$0.05/M in(-0.20)400Kctx26 benchmarks

GPT-5 ProOct 2025

43.3-2.0

$15.00/M in(+14.95)400Kctx8 benchmarks

See the full GPT-5 family →

Similar Models

Qwen2.5 72B Instruct Abliterated

HuiHui AI

87.5TBD

Links

Info

OpenAI Pricing explorer Developers · API

Research

Technical Report

Documentation

API Docs Playground

Community

@OpenAI

BenchGecko API

gpt-5-chat

Specifications

Typemultimodal
Context128K tokens (~64 books)
ReleasedAug 2025
LicenseProprietary
StatusActive
Cost / Message~$0.013

Available On

OpenAI$1.25

Frequently Asked Questions

GPT-5 Chat is a proprietary multimodal AI model by OpenAI, released in August 2025. It has an average benchmark score of 89.0. Context window: 128K tokens.

Benchmarks

Chatbot Arena Elo — Overall Aider polyglot HELM — IFEval HELM — MMLU-Pro HELM — WildBench

OpenAI · Provider OpenAI · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

GPT-5 Chat

Frequently Asked Questions

Related Models

Benchmarks

Related Pages