DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance. It builds on DeepSeek Sparse Attention (DSA) for efficient long-context processing, then scales post-training reinforcement learning...
Tested on 9 benchmarks with 78.2% average. Top scores: OpenCompass — AIME2025 (96.0%), OpenCompass — IFEval (91.7%), OpenCompass — GPQA-Diamond (86.7%).
Qwen3.5 397B A17B scores 96.3 (101% as good) at $0.39/1M input · 3% cheaper
OpenCompass Live Code Bench v6. Fresh competitive programming problems to evaluate code generation without memorization.
OpenCompass evaluation on AIME 2025 problems. Tests mathematical reasoning on fresh competition problems.
OpenCompass evaluation of GPQA Diamond. PhD-level science questions from the hardest subset.
OpenCompass MMLU-Pro evaluation. Harder knowledge test with more answer choices.
OpenCompass evaluation of Humanitys Last Exam. Expert-level cross-discipline knowledge test.
- Typetext
- Context164K tokens (~82 books)
- ReleasedDec 2025
- LicenseOpen Source
- StatusActive
- Cost / Message~$0.002