LIVETracking 994 AI models from 267 providers.

Charts·Build live AI market views Open charts Build your own chart

Home/Models/o3 Pro

o3 Pro

by OpenAI · Released Jun 2025

Multimodal

59.9

avg score

Rank #85

Better than 69% of all models

Context

200K tokens (~100 books)

Input $/1M

$20.00

Output $/1M

$80.00

Type

multimodal

License

Proprietary

Benchmarks

8 tested

Data updated today

About

The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro model uses more compute to think harder and provide consistently...

Tested on 8 benchmarks with 61.2% average. Top scores: Fiction.LiveBench (97.2%), Lech Mazur Writing (86.3%), Aider polyglot (84.9%).

Looking for similar performance at lower cost?
Qwen3 Max scores 60.4 (101% as good) at $0.78/1M input · 96% cheaper

Capabilities

coding

71.6

#21 globally

reasoning

32.1

#100 globally

knowledge

70.6

#18 globally

Benchmark Scores

Tested on 8 benchmarks · Ranked across 3 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

codingCompare coding →

Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.

84.9—

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

58.2—

reasoningCompare reasoning →

Abstraction and Reasoning Corpus. Tests fluid intelligence through novel visual pattern recognition puzzles. Core measure of general intelligence.

59.3—

ARC-AGI 2, harder sequel to ARC. More complex abstract reasoning patterns that test generalization ability beyond training data.

4.9—

knowledgeCompare knowledge →

Fiction.LiveBench

LiveBench fiction analysis. Tests literary comprehension and creative text understanding.

97.2—

Lech Mazur Writing

Writing quality evaluation by Lech Mazur. Tests prose quality, coherence, and stylistic ability.

86.3—

Professional Reasoning — Legal

SEAL Pro Reasoning Legal. Tests legal reasoning and case analysis ability.

49.7—

Quick compare:

vs Qwen3.6 Max Preview

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · OpenAI o3

$2.00/M in200Kctx33 benchmarks

o3 Deep ResearchOct 2025

$10.00/M in(+8)200Kctx

o3 MiniJan 2025

$1.10/M in(-8.90)200Kctx17 benchmarks

o3 Mini HighFeb 2025

$1.10/M in200Kctx2 benchmarks

$20.00/M in(+18.90)200Kctx8 benchmarks

See the full o3 family →

Similar Models

Qwen3.6 Max Preview

Links

Info

OpenAI Pricing explorer Developers · API

Research

Technical Report

Documentation

API Docs Playground

Community

BenchGecko API

o3-pro

Specifications

Typemultimodal
Context200K tokens (~100 books)
ReleasedJun 2025
LicenseProprietary
StatusActive
Cost / Message~$0.120

Available On

OpenAI$20.00

Categories

coding reasoning knowledge

Learn More

context-window transformer tokens

Share & Export

Related Models

Qwen3.6 Max Preview

Frequently Asked Questions

o3 Pro is a proprietary multimodal AI model by OpenAI, released in June 2025. It has an average benchmark score of 59.9. Context window: 200K tokens.

Related Models

Qwen3.6 Max Preview · Alibaba Qwen MiMo-V2-Pro · xiaomi Qwen-14B · Alibaba Qwen Grok 4 · xAI Stable Beluga 2 · Unknown

Benchmarks

Fiction.LiveBench Lech Mazur Writing Aider polyglot ARC-AGI WeirdML

Related Pages

OpenAI · Provider OpenAI · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary