How much does Llama 3.1 Nemotron Ultra 253B v1 cost?

Llama 3.1 Nemotron Ultra 253B v1 costs $0.60 per million input tokens and $1.80 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.003 per message.

Is Llama 3.1 Nemotron Ultra 253B v1 open source?

Yes, Llama 3.1 Nemotron Ultra 253B v1 is open source.

How does Llama 3.1 Nemotron Ultra 253B v1 compare to GPT-4o Audio?

Llama 3.1 Nemotron Ultra 253B v1 has an average score of 0.0 while GPT-4o Audio scores 0.0. GPT-4o Audio slightly outperforms Llama 3.1 Nemotron Ultra 253B v1 overall. Llama 3.1 Nemotron Ultra 253B v1 costs $0.60/1M input vs GPT-4o Audio at $2.50/1M input. See full comparison →

Home/Models/Llama 3.1 Nemotron Ultra 253B v1

Llama 3.1 Nemotron Ultra 253B v1

Name: Llama 3.1 Nemotron Ultra 253B v1
Price: 0.6 USD
Author: NVIDIA

by NVIDIA · Released Apr 2025

Open Source

Compare

Context

131K tokens (~66 books)

Input $/1M

$0.60

Output $/1M

$1.80

Type

text

License

Open Source

Benchmarks

4 tested

Data updated today

About

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

Tested on 4 benchmarks with 0.0% average. Top scores: Chatbot Arena Elo — Overall (1346.8%), Artificial Analysis — Quality Index (15.0%), Artificial Analysis — Coding Index (13.1%).

Capabilities

speed

17.7

#81 globally

Benchmark Scores

Compare All

Tested on 4 benchmarks · Ranked across 2 categories

Score Distribution (all 274 models)