Home/Models/Llama 3.1 Nemotron Ultra 253B v1
NVIDIA logo

Llama 3.1 Nemotron Ultra 253B v1

by NVIDIA · Released Apr 2025

Open Source
Compare
Context
131K tokens (~66 books)
Input $/1M
$0.60
Output $/1M
$1.80
Type
text
License
Open Source
Benchmarks
4 tested
Data updated today
About

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

Tested on 4 benchmarks with 0.0% average. Top scores: Chatbot Arena Elo — Overall (1346.8%), Artificial Analysis — Quality Index (15.0%), Artificial Analysis — Coding Index (13.1%).

Capabilities
speed
17.7
#56 globally
Benchmark Scores
Compare All
Tested on 4 benchmarks · Ranked across 2 categories
Score Distribution (all 233 models)
0255075100
Chatbot Arena Elo — Overall

Chatbot Arena overall Elo rating. Crowdsourced human preference ranking from blind head-to-head comparisons across all topics.

1347
Artificial Analysis — Quality Index

Artificial Analysis Quality Index. Composite quality score combining multiple benchmark results into a single metric.

15.0
Artificial Analysis — Coding Index

Artificial Analysis Coding Index. Composite coding quality score from multiple code benchmarks.

13.1
Artificial Analysis — Agentic Index

Artificial Analysis Agentic Index. Composite score measuring agent capability across tool use and planning tasks.

3.8
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
llama-3-1-nemotron-ultra-253b-v1
Specifications
  • Typetext
  • Context131K tokens (~66 books)
  • ReleasedApr 2025
  • LicenseOpen Source
  • StatusActive
  • Cost / Message~$0.003
Available On
NVIDIA logoNVIDIA$0.60
Categories
Share & Export
Tweet
Llama 3.1 Nemotron Ultra 253B v1 is an open-source text AI model by NVIDIA, released in April 2025. Context window: 131K tokens.