Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...
Tested on 5 benchmarks with 0.0% average. Top scores: Chatbot Arena Elo — Overall (1347.0%), Chatbot Arena Elo — Coding (1182.2%), Artificial Analysis — Agentic Index (39.7%).
Chatbot Arena overall Elo rating. Crowdsourced human preference ranking from blind head-to-head comparisons across all topics.
Chatbot Arena coding Elo. Human preference ranking specifically for coding tasks and technical questions.
Artificial Analysis Agentic Index. Composite score measuring agent capability across tool use and planning tasks.
Artificial Analysis Quality Index. Composite quality score combining multiple benchmark results into a single metric.
Artificial Analysis Coding Index. Composite coding quality score from multiple code benchmarks.
- Typetext
- Context128K tokens (~64 books)
- ReleasedMar 2026
- LicenseProprietary
- StatusActive
- Cost / Message~$0.001