How much does GPT-5.1-Codex cost?

GPT-5.1-Codex costs $1.25 per million input tokens and $10.00 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.013 per message.

What benchmarks has GPT-5.1-Codex been tested on?

GPT-5.1-Codex has been evaluated on 8 benchmarks. Top scores: LiveBench — Reasoning: 82.0, LiveBench — Mathematics: 79.6, LiveBench — Coding: 71.8.

Is GPT-5.1-Codex open source?

No, GPT-5.1-Codex is a proprietary model by OpenAI.

How does GPT-5.1-Codex compare to Grok Build 0.1?

GPT-5.1-Codex has an average score of 77.7 while Grok Build 0.1 scores 77.1. GPT-5.1-Codex outperforms Grok Build 0.1 overall. GPT-5.1-Codex costs $1.25/1M input vs Grok Build 0.1 at $1.00/1M input. See full comparison →

Home/Models/GPT-5.1-Codex

GPT-5.1-Codex

Name: GPT-5.1-Codex
Price: 1.25 USD
Author: OpenAI

by OpenAI · Released Nov 2025

Multimodal

77.7

avg score

Rank #27

Compare

Better than 90% of all models

Context

400K tokens (~200 books)

Input $/1M

$1.25

Output $/1M

$10.00

Type

multimodal

License

Proprietary

Benchmarks

8 tested

Data updated today

About

GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....

Tested on 8 benchmarks with 68.6% average. Top scores: LiveBench — Reasoning (82.0%), LiveBench — Mathematics (79.6%), LiveBench — Coding (71.8%).

Looking for similar performance at lower cost?
Grok Build 0.1 scores 77.1 (99% as good) at $1.00/1M input · 20% cheaper

Capabilities

coding

62.6

#42 globally

reasoning

71.4

#34 globally

math

79.6

#24 globally

knowledge

68.6

#25 globally

language

66.4

#91 globally

Benchmark Scores

Compare All

Tested on 8 benchmarks · Ranked across 5 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

codingCompare coding →

LiveBench — Coding

Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.

71.8—

LiveBench — Agentic Coding

LiveBench coding tasks that require multi-step reasoning and tool use. Tests planning and execution of complex coding workflows.

53.3—

reasoningCompare reasoning →

LiveBench — Reasoning

Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.

82.0—

LiveBench — Data Analysis

Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.

60.8—

mathCompare math →

LiveBench — Mathematics

Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.

79.6—

Quick compare:

vs Grok Build 0.1

vs Claude Opus 4.7

vs GPT-5.4

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · OpenAI GPT-5.1

GPT-5.1Nov 2025

49.6

$1.25/M in400Kctx24 benchmarks

GPT-5.1 ChatNov 2025

$1.25/M in128Kctx(-272K)

GPT-5.1-CodexNov 2025

68.6+68.6

$1.25/M in400Kctx(+272K)8 benchmarks

GPT-5.1-Codex-MaxDec 2025

72.0+3.4

$1.25/M in400Kctx8 benchmarks

GPT-5.1-Codex-MiniNov 2025

60.4-11.6

$0.25/M in(-1)400Kctx8 benchmarks

See the full GPT-5.1 family →

Similar Models

Links

Info

OpenAI Pricing explorer Developers · API

Research

Technical Report

Documentation

API Docs Playground

Community

@OpenAI

BenchGecko API

gpt-5-1-codex

Specifications

Typemultimodal
Context400K tokens (~200 books)
ReleasedNov 2025
LicenseProprietary
StatusActive
Cost / Message~$0.013

Available On

OpenAI$1.25

Frequently Asked Questions

GPT-5.1-Codex is a proprietary multimodal AI model by OpenAI, released in November 2025. It has an average benchmark score of 77.7. Context window: 400K tokens.

Benchmarks

LiveBench — Reasoning LiveBench — Mathematics LiveBench — Coding LiveBench — Language LiveBench — Overall

OpenAI · Provider OpenAI · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

GPT-5.1-Codex

Frequently Asked Questions

Related Models

Benchmarks

Related Pages