Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...
Tested on 4 benchmarks with 0.0% average. Top scores: Chatbot Arena Elo — Overall (1346.8%), Artificial Analysis — Quality Index (15.0%), Artificial Analysis — Coding Index (13.1%).
Chatbot Arena overall Elo rating. Crowdsourced human preference ranking from blind head-to-head comparisons across all topics.
Artificial Analysis Quality Index. Composite quality score combining multiple benchmark results into a single metric.
Artificial Analysis Coding Index. Composite coding quality score from multiple code benchmarks.
Artificial Analysis Agentic Index. Composite score measuring agent capability across tool use and planning tasks.
- Typetext
- Context131K tokens (~66 books)
- ReleasedApr 2025
- LicenseOpen Source
- StatusActive
- Cost / Message~$0.003