Beta
Home/Comparar/GPT-4.1 vs gpt-oss-120b

GPT-4.1 vs gpt-oss-120b

Lado a lado. Cada métrica. Cada benchmark.

OpenAI
43.3
puntuación promedio
7/12
benchmarks
OpenAI
46.9
puntuación promedio
5/12
benchmarks
TipoGPT-4.1gpt-oss-120b
ProviderOpenAI logoOpenAIOpenAI logoOpenAI
puntuación promedio43.346.9
Precio de entrada$2.00$0.04
Precio de salida$8.00$0.19
Ventana de contexto1.0M tokens (~524 books)131K tokens (~66 books)
Publicado el2025-04-142025-08-05
Código abiertoProprietaryOpen Source

12 benchmarks · GPT-4.1: 7, gpt-oss-120b: 5

BenchmarkCategoríaGPT-4.1gpt-oss-120b
Aider polyglotcoding52.441.8
Fiction.LiveBenchknowledge63.944.4
GPQA diamondknowledge55.967.7
HELM — GPQAknowledge65.968.4
HELM — IFEvallanguage83.883.6
HELM — MMLU-Proknowledge81.179.5
HELM — Omni-MATHmath47.168.8
HELM — WildBenchreasoning85.484.5
OTIS Mock AIME 2024-2025math38.388.9
SimpleBenchreasoning12.46.5
SWE-Bench Verified (Bash Only)coding39.626.0
WeirdMLcoding39.048.2