How much does GPT-4 (older v0314) cost?

GPT-4 (older v0314) costs $30.00 per million input tokens and $60.00 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.120 per message.

What benchmarks has GPT-4 (older v0314) been tested on?

GPT-4 (older v0314) has been evaluated on 7 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1285.8, GSM8K: 92.0, MMLU: 81.9.

Is GPT-4 (older v0314) open source?

No, GPT-4 (older v0314) is a proprietary model by OpenAI.

How does GPT-4 (older v0314) compare to Gemma 4 31B?

GPT-4 (older v0314) has an average score of 63.7 while Gemma 4 31B scores 63.9. Gemma 4 31B slightly outperforms GPT-4 (older v0314) overall. GPT-4 (older v0314) costs $30.00/1M input vs Gemma 4 31B at $0.12/1M input. See full comparison →

Home/Models/GPT-4 (older v0314)

GPT-4 (older v0314)

Name: GPT-4 (older v0314)
Price: 30 USD
Author: OpenAI

by OpenAI · Released May 2023

63.7

avg score

Rank #69

Compare

Better than 75% of all models

Context

8K tokens (~4 books)

Input $/1M

$30.00

Output $/1M

$60.00

Type

text

License

Proprietary

Benchmarks

7 tested

Data updated today

About

GPT-4-0314 is the first version of GPT-4 released, with a context length of 8,192 tokens, and was supported until June 14. Training data: up to Sep 2021.

Tested on 7 benchmarks with 55.0% average. Top scores: Chatbot Arena Elo — Overall (1285.8%), GSM8K (92.0%), MMLU (81.9%).

Looking for similar performance at lower cost?
Qwen3 30B A3B Thinking 2507 scores 63.5 (100% as good) at $0.08/1M input · 100% cheaper

Capabilities

coding

66.2

#29 globally

math

46.2

#104 globally

knowledge

57.1

#79 globally

Benchmark Scores

Compare All

Tested on 7 benchmarks · Ranked across 4 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

codingCompare coding →

Aider — Code Editing

Code editing benchmark from the Aider project. Measures ability to apply targeted code changes while maintaining correctness and style.

66.2—

mathCompare math →

GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

92.0—

OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

0.5—

knowledgeCompare knowledge →

MMLU

Massive Multitask Language Understanding. 57 subjects from STEM, humanities, and social sciences. The most widely-cited knowledge benchmark.

81.9—

Winogrande

Commonsense coreference resolution. Tests understanding of pronoun references in ambiguous sentences.

75.0—

GPQA diamond

Graduate-level science questions written by PhD experts. Diamond subset contains questions where experts disagree, testing deep understanding.

14.3—

Quick compare:

vs Gemma 4 31B

vs Qwen3 30B A3B Thinking 2507

vs Meta Llama 3 8B

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · OpenAI GPT-4

GPT-4May 2023

68.7

$30.00/M in8Kctx1 benchmark

GPT-4 (older v0314)May 2023

55.0-13.7

$30.00/M in8Kctx7 benchmarks

GPT-4 TurboApr 2024

51.0-4.0

$10.00/M in(-20)128Kctx(+120K)12 benchmarks

GPT-4 Turbo (older v1106)Nov 2023

65.4+14.4

$10.00/M in128Kctx2 benchmarks

GPT-4 Turbo PreviewJan 2024

$10.00/M in128Kctx

See the full GPT-4 family →

Similar Models

Gemma 4 31B

Google DeepMind

63.9$0.12/1M

Qwen3 30B A3B Thinking 2507

Alibaba Qwen

63.5$0.08/1M

Meta Llama 3 8B

Frequently Asked Questions

GPT-4 (older v0314) is a proprietary text AI model by OpenAI, released in May 2023. It has an average benchmark score of 63.7. Context window: 8K tokens.

Benchmarks

Chatbot Arena Elo — Overall GSM8K MMLU Winogrande Aider — Code Editing

OpenAI · Provider OpenAI · Economy All Models Compare Models Pricing Developers · API

GPT-4 (older v0314)

Frequently Asked Questions

Related Models

Benchmarks

Related Pages