Beta
Context · 1M+

Cheapest 1M context LLMs

Every LLM with a 1,000,000+ token context window. Ranked by input price per 1M tokens.

Models40
Cheapest$-1000000.00
Min context1M tokens
What this page is
This page lists every priced model with a context window of at least one million tokens. 1M context unlocks whole-repo coding, book-length analysis, and massive multi-document RAG without chunking. The cost per call can be steep, so compare carefully and lean on context caching whenever possible.

1M+ context models, cheapest first.

#ModelIn $/1MOut $/1MType
1openrouter logoAuto Router$-1000000.00$-1000000.00Closed
2Google DeepMind logoLyria 3 Clip Preview$0.00$0.00Closed
3Google DeepMind logoLyria 3 Pro Preview$0.00$0.00Closed
4Alibaba Qwen logoQwen3.6 Plus (free)$0.00$0.00Closed
5Alibaba Qwen logoQwen3.6 Plus Preview (free)$0.00$0.00OSS
6Alibaba Qwen logoQwen3.5-Flash$0.07$0.26OSS
7Google DeepMind logoGemini 2.0 Flash Lite$0.07$0.30Closed
8Google DeepMind logoGemini 2.0 Flash$0.10$0.40Closed
9Google DeepMind logoGemini 2.5 Flash Lite$0.10$0.40Closed
10Google DeepMind logoGemini 2.5 Flash Lite Preview 09-2025$0.10$0.40Closed
11OpenAI logoGPT-4.1 Nano$0.10$0.40Closed
12Meta logoLlama 4 Maverick$0.15$0.60OSS
13Alibaba Qwen logoQwen3 Coder Flash$0.20$0.97OSS
14xAI logoGrok 4 Fast$0.20$0.50Closed
15xAI logoGrok 4.1 Fast$0.20$0.50Closed
16minimax logoMiniMax-01$0.20$1.10OSS
17Google DeepMind logoGemini 3.1 Flash Lite Preview$0.25$1.50Closed
18Alibaba Qwen logoQwen Plus 0728$0.26$0.78OSS
19Alibaba Qwen logoQwen Plus 0728 (thinking)$0.26$0.78OSS
20Alibaba Qwen logoQwen-Plus$0.26$0.78OSS
21Alibaba Qwen logoQwen3.5 Plus 2026-02-15$0.26$1.56OSS
22Google DeepMind logoGemini 2.5 Flash$0.30$2.50Closed
23Amazon logoNova 2 Lite$0.30$2.50Closed
24Alibaba Qwen logoQwen3.6 Plus$0.33$1.95OSS
25OpenAI logoGPT-4.1 Mini$0.40$1.60Closed
26minimax logoMiniMax M1$0.40$2.20Closed
27Google DeepMind logoGemini 3 Flash Preview$0.50$3.00Closed
28writer logoPalmyra X5$0.60$6.00Closed
29Alibaba Qwen logoQwen3 Coder Plus$0.65$3.25OSS
30xiaomi logoMiMo-V2-Pro$1.00$3.00Closed
31Google DeepMind logoGemini 2.5 Pro$1.25$10.00Closed
32Google DeepMind logoGemini 2.5 Pro Preview 05-06$1.25$10.00Closed
33Google DeepMind logoGemini 2.5 Pro Preview 06-05$1.25$10.00Closed
34Google DeepMind logoGemini 3.1 Pro Preview$2.00$12.00Closed
35Google DeepMind logoGemini 3.1 Pro Preview Custom Tools$2.00$12.00Closed
36OpenAI logoGPT-4.1$2.00$8.00Closed
37xAI logoGrok 4.20$2.00$6.00Closed
38xAI logoGrok 4.20 Beta$2.00$6.00Closed
39xAI logoGrok 4.20 Multi-Agent$2.00$6.00Closed
40xAI logoGrok 4.20 Multi-Agent Beta$2.00$6.00Closed
Cheapest
Auto Router
$-1000000.00/M
$ per 1M input tokens
Why the gap

Premium 1M-context models pay for better accuracy at the tail of the window and faster ingestion. For research and one-shot analysis, the cheap end delivers equivalent answers on most prompts.

Most expensive
Gemini 3.1 Pro Preview
$2.00/M
$ per 1M input tokens
Gemini 2.5 Pro and Flash were first to a real 1M window. Claude Sonnet extended to 1M. Qwen3 Long is a strong open-source option. MiniMax and several Chinese labs also ship 1M+. See the table above for current live list.