What is the difference between GPT-5 and o3?

GPT-5 is a general-purpose model · fast, cheap, conversational. o3 is a reasoning-specialized model that spends more compute thinking before answering. Higher accuracy on math, code, logic · 10-50× more expensive per answer.

When did GPT-5 launch?

OpenAI launched GPT-5 in mid-2025. Pricing and availability have shifted multiple times since.

Model familiesGPTReading · ~3 min · 67 words deep

GPT

OpenAI's foundation-model family · started in 2018, scaled to frontier dominance with GPT-4 and beyond.

GPT family on /family

TL;DR

OpenAI's foundation-model family · started in 2018, scaled to frontier dominance with GPT-4 and beyond.

Live · family members

GPT variants tracked on BenchGecko

Full family page

GPT-5 Chat

$1.25/M in · 81.9 avg

GPT-5.1-Codex-Max

$1.25/M in · 72.0 avg

GPT-5.2-Codex

$1.75/M in · 70.6 avg

GPT-4

$30.00/M in · 68.7 avg

gpt-oss-120b (free)

$0.00/M in · 68.7 avg

GPT-5.1-Codex

$1.25/M in · 68.6 avg

gpt-oss-20b

$0.03/M in · 67.4 avg

GPT-5.4 Pro

$30.00/M in · 66.7 avg

Level 1

Basic

GPT stands for Generative Pre-trained Transformer. OpenAI published GPT-1 in 2018, GPT-2 in 2019, GPT-3 in 2020, GPT-4 in 2023, GPT-5 in 2025. Each generation scaled parameters, training compute, and training data. Modern GPT models (GPT-5 class) are multi-modal, support reasoning, and serve as the foundation for ChatGPT's consumer product.

Level 2

Deep

GPT architecture: decoder-only transformer with causal attention. Pretraining on internet-scale text with next-token prediction. Post-training includes SFT, RLHF (later DPO), and safety tuning. GPT-4 class models are mixture-of-experts · GPT-5 likely continues this direction. OpenAI ships multiple tiers per generation: flagship (GPT-5), efficient variants (GPT-5 mini, GPT-5 nano), and reasoning-specialized (o1, o3, o4-mini). Pricing varies dramatically · frontier tier $10-60 per million output tokens, efficient tier $0.10-2.

Level 3

Expert

GPT-4 is believed to be ~1.8T total parameters with MoE sparsity (estimates vary · OpenAI has not disclosed). Training FLOPs estimated at 2e25 for GPT-4, possibly 5-10× for GPT-5. Compute scaling followed Chinchilla ratios until GPT-4; post-GPT-4 shifted to train-for-longer on same parameters. Post-training is OpenAI's proprietary differentiator · data mix, RLHF/DPO recipes, and safety tuning are closely held. Reasoning models in the o-series are RL-tuned on verifiable-reward tasks and use hidden CoT at inference. OpenAI serves GPT models via the API, ChatGPT, Azure OpenAI (Microsoft), and select enterprise partnerships.

Why this matters now

GPT-5 set the 2026 pricing ceiling · $10/M output for the flagship, dropping to $0.25/M for GPT-5 nano within 6 months of launch.

The takeaway for you

Depending on why you're here

If you are a

Researcher

·Decoder-only transformer · MoE at frontier tier (unconfirmed for latest)
·o-series uses RL on verifiable rewards for reasoning
·Training FLOPs doubling per generation

If you are a

Builder

·Use GPT-5 nano/mini for cost-sensitive workloads
·GPT-5 full for frontier quality · o3 for hard reasoning
·See /family/gpt for all variants

If you are a

Investor

·OpenAI sets the frontier pricing ceiling · competitors price below
·Azure partnership is OpenAI's distribution moat
·ChatGPT consumer franchise drives most of OpenAI's revenue

If you are a

Curious · Normie

·The AI family behind ChatGPT
·Made by OpenAI · powered the AI boom
·Newer versions are smarter and cheaper each year

Gecko's take

GPT is the brand that made AI mainstream. Every frontier generation drags the rest of the market along with it.

Frequently Asked Questions

ChatGPT is the consumer product built on top of GPT models. GPT is the underlying model family.

GPT

Basic

Deep

Expert

Depending on why you're here

Frequently Asked Questions

Related terms

Glossary

Explore live data

Cite or embed