How much does GPT-5.1-Codex-Max cost?

GPT-5.1-Codex-Max costs $1.25 per million input tokens and $10.00 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.013 per message.

What benchmarks has GPT-5.1-Codex-Max been tested on?

GPT-5.1-Codex-Max has been evaluated on 8 benchmarks. Top scores: LiveBench — Reasoning: 84.6, LiveBench — Mathematics: 83.7, LiveBench — Coding: 81.4.

Is GPT-5.1-Codex-Max open source?

No, GPT-5.1-Codex-Max is a proprietary model by OpenAI.

How does GPT-5.1-Codex-Max compare to DeepSeek R1 Distill Qwen 14B?

GPT-5.1-Codex-Max has an average score of 85.6 while DeepSeek R1 Distill Qwen 14B scores 85.4. GPT-5.1-Codex-Max outperforms DeepSeek R1 Distill Qwen 14B overall. See full comparison →

Home/Models/GPT-5.1-Codex-Max

GPT-5.1-Codex-Max

Name: GPT-5.1-Codex-Max
Price: 1.25 USD
Author: OpenAI

by OpenAI · Released Dec 2025

Multimodal

85.6

avg score

Rank #13

Compare

Top 5% of all models

Context

400K tokens (~200 books)

Input $/1M

$1.25

Output $/1M

$10.00

Type

multimodal

License

Proprietary

Benchmarks

8 tested

Data updated today

About

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic...

Tested on 8 benchmarks with 72.0% average. Top scores: LiveBench — Reasoning (84.6%), LiveBench — Mathematics (83.7%), LiveBench — Coding (81.4%).

Looking for similar performance at lower cost?
DeepSeek V4 Pro scores 86.5 (101% as good) at $0.43/1M input · 65% cheaper

Capabilities

coding

69.0

#24 globally

reasoning

69.7

#36 globally

math

83.7

#17 globally

knowledge

72.0

#15 globally

language

71.3

#74 globally

Benchmark Scores

Compare All

Tested on 8 benchmarks · Ranked across 5 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

codingCompare coding →

LiveBench — Coding

Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.

81.4—

LiveBench — Agentic Coding

LiveBench coding tasks that require multi-step reasoning and tool use. Tests planning and execution of complex coding workflows.

56.7—

reasoningCompare reasoning →

LiveBench — Reasoning

Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.

84.6—

LiveBench — Data Analysis

Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.

54.9—

mathCompare math →

LiveBench — Mathematics

Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.

83.7—

Quick compare:

vs DeepSeek R1 Distill Qwen 14B

vs DeepSeek-V2 (MoE-236B, May 2024)

vs DeepSeek V4 Pro

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · OpenAI GPT-5.1

GPT-5.1Nov 2025

49.6

$1.25/M in400Kctx24 benchmarks

GPT-5.1 ChatNov 2025

$1.25/M in128Kctx(-272K)

GPT-5.1-CodexNov 2025

68.6+68.6

$1.25/M in400Kctx(+272K)8 benchmarks

GPT-5.1-Codex-MaxDec 2025

72.0+3.4

$1.25/M in400Kctx8 benchmarks

GPT-5.1-Codex-MiniNov 2025

60.4-11.6

$0.25/M in(-1)400Kctx8 benchmarks

See the full GPT-5.1 family →

Similar Models

DeepSeek R1 Distill Qwen 14B

DeepSeek

85.4TBD

DeepSeek-V2 (MoE-236B, May 2024)

Links

Info

OpenAI Pricing explorer Developers · API

Research

Technical Report

Documentation

API Docs Playground

Community

@OpenAI

BenchGecko API

gpt-5-1-codex-max

Specifications

Typemultimodal
Context400K tokens (~200 books)
ReleasedDec 2025
LicenseProprietary
StatusActive
Cost / Message~$0.013

Available On

OpenAI$1.25

Frequently Asked Questions

GPT-5.1-Codex-Max is a proprietary multimodal AI model by OpenAI, released in December 2025. It has an average benchmark score of 85.6. Context window: 400K tokens.

Benchmarks

LiveBench — Reasoning LiveBench — Mathematics LiveBench — Coding LiveBench — Language LiveBench — Overall

OpenAI · Provider OpenAI · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

GPT-5.1-Codex-Max

Frequently Asked Questions

Related Models

Benchmarks

Related Pages