Beta
Home/Comparar/o3 vs gpt-oss-120b

o3 vs gpt-oss-120b

Lado a lado. Cada métrica. Cada benchmark.

OpenAI logoo3Ganador
OpenAI
55.2
puntuación promedio
13/14
benchmarks
OpenAI
46.9
puntuación promedio
1/14
benchmarks
Tipoo3gpt-oss-120b
ProviderOpenAI logoOpenAIOpenAI logoOpenAI
puntuación promedio55.246.9
Precio de entrada$2.00$0.04
Precio de salida$8.00$0.19
Ventana de contexto200K tokens (~100 books)131K tokens (~66 books)
Publicado el2025-04-162025-08-05
Código abiertoProprietaryOpen Source

14 benchmarks · o3: 13, gpt-oss-120b: 1

BenchmarkCategoríao3gpt-oss-120b
Aider polyglotcoding81.341.8
Fiction.LiveBenchknowledge88.944.4
GPQA diamondknowledge75.867.7
HELM — GPQAknowledge75.368.4
HELM — IFEvallanguage86.983.6
HELM — MMLU-Proknowledge85.979.5
HELM — Omni-MATHmath71.468.8
HELM — WildBenchreasoning86.184.5
Lech Mazur Writingknowledge83.977.3
OTIS Mock AIME 2024-2025math83.988.9
SimpleBenchreasoning43.76.5
SimpleQA Verifiedknowledge53.013.9
SWE-Bench Verified (Bash Only)coding58.426.0
WeirdMLcoding52.448.2