GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...
Tested on 26 benchmarks with 45.3% average. Top scores: MATH level 5 (95.2%), HELM — IFEval (93.2%), OTIS Mock AIME 2024-2025 (81.1%).
gpt-oss-20b scores 44.4 (100% as good) at $0.03/1M input · 40% cheaper
Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.
Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.
SWE-bench Verified solved using only bash commands, no specialized frameworks. Tests raw terminal-based problem solving.
Stanford HELM WildBench evaluation. Tests reasoning on challenging real-world tasks.
Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.
Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.
Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.
Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.
Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.
- Typemultimodal
- Context400K tokens (~200 books)
- ReleasedAug 2025
- LicenseProprietary
- StatusActive
- Cost / Message~$0.001