gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...
Tested on 9 benchmarks with 66.4% average. Top scores: OpenCompass — IFEval (88.9%), OpenCompass — AIME2025 (87.9%), OpenCompass — MMLU-Pro (72.8%).
OpenCompass Live Code Bench v6. Fresh competitive programming problems to evaluate code generation without memorization.
OpenCompass evaluation on AIME 2025 problems. Tests mathematical reasoning on fresh competition problems.
OpenCompass MMLU-Pro evaluation. Harder knowledge test with more answer choices.
OpenCompass evaluation of GPQA Diamond. PhD-level science questions from the hardest subset.
OpenCompass evaluation of Humanitys Last Exam. Expert-level cross-discipline knowledge test.
- Typetext
- Context131K tokens (~66 books)
- ReleasedAug 2025
- LicenseOpen Source
- StatusActive
- Cost / Message~$0.000