MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios. It is highly adaptable to general agent frameworks like...
Tested on 13 benchmarks with 58.1% average. Top scores: Chatbot Arena Elo — Overall (1445.0%), Chatbot Arena Elo — Coding (1433.4%), LiveBench — Mathematics (77.0%).
Qwen3 30B A3B Thinking 2507 scores 63.5 (101% as good) at $0.08/1M input · 92% cheaper
Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.
LiveBench coding tasks that require multi-step reasoning and tool use. Tests planning and execution of complex coding workflows.
Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.
Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.
Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.
- Typetext
- Context1.0M tokens (~524 books)
- ReleasedMar 2026
- LicenseProprietary
- StatusActive
- Cost / Message~$0.005