How much does GPT-4.1 Nano cost?

GPT-4.1 Nano costs $0.10 per million input tokens and $0.40 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.001 per message.

What benchmarks has GPT-4.1 Nano been tested on?

GPT-4.1 Nano has been evaluated on 14 benchmarks. Top scores: HELM — IFEval: 84.3, HELM — WildBench: 81.1, MATH level 5: 70.0.

Is GPT-4.1 Nano open source?

No, GPT-4.1 Nano is a proprietary model by OpenAI.

How does GPT-4.1 Nano compare to Voxtral Small 24B 2507?

GPT-4.1 Nano has an average score of 25.8 while Voxtral Small 24B 2507 scores 26.3. Voxtral Small 24B 2507 slightly outperforms GPT-4.1 Nano overall. GPT-4.1 Nano costs $0.10/1M input vs Voxtral Small 24B 2507 at $0.10/1M input. See full comparison →

Home/Models/GPT-4.1 Nano

GPT-4.1 Nano

Name: GPT-4.1 Nano
Price: 0.1 USD
Author: OpenAI

by OpenAI · Released Apr 2025

Multimodal1M Context

25.8

avg score

Rank #192

Compare

Better than 18% of all models

Context

1.0M tokens (~524 books)

Input $/1M

$0.10

Output $/1M

$0.40

Type

multimodal

License

Proprietary

Benchmarks

14 tested

Data updated today

About

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...

Tested on 14 benchmarks with 35.2% average. Top scores: HELM — IFEval (84.3%), HELM — WildBench (81.1%), MATH level 5 (70.0%).

Looking for similar performance at lower cost?
Gemma 3 27B scores 25.1 (97% as good) at $0.08/1M input · 20% cheaper

Capabilities

coding

13.9

#134 globally

reasoning

27.1

#91 globally

math

34.1

#122 globally

knowledge

40.7

#149 globally

language

84.3

#34 globally

Benchmark Scores

Compare All

Tested on 14 benchmarks · Ranked across 5 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

19.0—

Aider polyglot

Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.

8.9—

reasoningCompare reasoning →

HELM — WildBench

Stanford HELM WildBench evaluation. Tests reasoning on challenging real-world tasks.

81.1—

ARC-AGI-2

ARC-AGI 2, harder sequel to ARC. More complex abstract reasoning patterns that test generalization ability beyond training data.

0.1—

ARC-AGI

Abstraction and Reasoning Corpus. Tests fluid intelligence through novel visual pattern recognition puzzles. Core measure of general intelligence.

0.1—

mathCompare math →

MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

70.0—

HELM — Omni-MATH

Stanford HELM evaluation of mathematical reasoning across diverse problem types.

36.7—

OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

28.8—

Quick compare:

vs Voxtral Small 24B 2507

vs DeepSeek Coder 33B

vs Gemma 3 27B

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · OpenAI GPT-4.1

GPT-4.1Apr 2025

43.3

$2.00/M in1.0Mctx22 benchmarks

GPT-4.1 MiniApr 2025

44.5+1.2

$0.40/M in(-1.60)1.0Mctx16 benchmarks

GPT-4.1 NanoApr 2025

35.2-9.3

$0.10/M in(-0.30)1.0Mctx14 benchmarks

See the full GPT-4.1 family →

Similar Models

Voxtral Small 24B 2507

Links

Info

OpenAI Pricing explorer Developers · API

Research

Technical Report

Documentation

API Docs Playground

Community

@OpenAI

BenchGecko API

gpt-4-1-nano

Specifications

Typemultimodal
Context1.0M tokens (~524 books)
ReleasedApr 2025
LicenseProprietary
StatusActive
Cost / Message~$0.001

Available On

OpenAI$0.10

Frequently Asked Questions

GPT-4.1 Nano is a proprietary multimodal AI model by OpenAI, released in April 2025. It has an average benchmark score of 25.8. Context window: 1M tokens.

Benchmarks

HELM — IFEval HELM — WildBench MATH level 5 HELM — MMLU-Pro HELM — GPQA

OpenAI · Provider OpenAI · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

GPT-4.1 Nano

Frequently Asked Questions

Related Models

Benchmarks

Related Pages