NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
Tested on 11 benchmarks with 51.8% average. Top scores: LiveBench — Coding (71.3%), LiveBench — If (58.2%), LiveBench — Mathematics (54.5%).
Phi 4 Mini Instruct scores 48.9 (102% as good) at $0.08/1M input · 84% cheaper
Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.
LiveBench coding tasks that require multi-step reasoning and tool use. Tests planning and execution of complex coding workflows.
Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.
Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.
Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.
- Typetext
- Context1.0M tokens (~500 books)
- ReleasedJun 2026
- LicenseOpen Source
- StatusActive
- Cost / Message~$0.003