Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token....
Tested on 10 benchmarks with 76.9% average. Top scores: Chatbot Arena Elo — Overall (1391.4%), OpenCompass — AIME2025 (95.7%), OpenCompass — IFEval (93.2%).
OpenCompass Live Code Bench v6. Fresh competitive programming problems to evaluate code generation without memorization.
OpenCompass evaluation on AIME 2025 problems. Tests mathematical reasoning on fresh competition problems.
OpenCompass evaluation of GPQA Diamond. PhD-level science questions from the hardest subset.
OpenCompass MMLU-Pro evaluation. Harder knowledge test with more answer choices.
OpenCompass evaluation of Humanitys Last Exam. Expert-level cross-discipline knowledge test.
- Typetext
- Context262K tokens (~131 books)
- ReleasedJan 2026
- LicenseOpen Source
- StatusActive
- Cost / Message~$0.001