Home/Models/Qwen3.5-Flash
Alibaba Qwen logo

Qwen3.5-Flash

by Alibaba Qwen · Released Feb 2026

Open SourceMultimodal1M Context
38.3
avg score
Rank #182
Compare
Better than 34% of all models
Context
1.0M tokens (~500 books)
Input $/1M
$0.07
Output $/1M
$0.26
Type
multimodal
License
Open Source
Benchmarks
8 tested
Data updated today
About

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

Tested on 8 benchmarks with 35.0% average. Top scores: Chatbot Arena Elo — Overall (1397.0%), Chatbot Arena Elo — Coding (1237.8%), OTIS Mock AIME 2024-2025 (85.5%).

Looking for similar performance at lower cost?
Mistral Nemo scores 39.0 (102% as good) at $0.02/1M input · 69% cheaper
Capabilities
math
30.6
#158 globally
knowledge
39.4
#184 globally
Benchmark Scores
Compare All
Tested on 8 benchmarks · Ranked across 3 categories
Score Distribution (all 274 models)
0255075100
▲ You are here
OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

85.5
FrontierMath-2025-02-28-Private

Original research-level math problems created by professional mathematicians. Problems are unpublished and cannot be memorized.

6.2
FrontierMath-Tier-4-2025-07-01-Private

Hardest tier of FrontierMath. Problems at the frontier of human mathematical ability, many unsolved by most mathematicians.

0.0
GPQA diamond

Graduate-level science questions written by PhD experts. Diamond subset contains questions where experts disagree, testing deep understanding.

78.5
Chess Puzzles

Tactical chess puzzles testing pattern recognition and multi-move calculation. Measures strategic reasoning ability.

20.0
SimpleQA Verified

Simple factual questions with verified correct answers. Tests accuracy of basic knowledge retrieval. Low scores indicate hallucination.

19.8
Chatbot Arena Elo — Overall

Chatbot Arena overall Elo rating. Crowdsourced human preference ranking from blind head-to-head comparisons across all topics.

1397
Chatbot Arena Elo — Coding

Chatbot Arena coding Elo. Human preference ranking specifically for coding tasks and technical questions.

1238
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Recently Happened
Qwen3.5-Flash released by Alibaba Qwen
Mar 3, 2026
Links
Documentation
Community
BenchGecko API
qwen3-5-flash-02-23
Specifications
  • Typemultimodal
  • Context1.0M tokens (~500 books)
  • ReleasedFeb 2026
  • LicenseOpen Source
  • StatusActive
  • Cost / Message~$0.000
Available On
Alibaba Qwen logoAlibaba Qwen$0.07
Categories
Share & Export
Tweet
Qwen3.5-Flash is an open-source multimodal AI model by Alibaba Qwen, released in February 2026. It has an average benchmark score of 38.3. Context window: 1M tokens.